o) Springer 


. 


> «@ 


= nae “* 


ite ed 


Graduate Texts in Contemporary Physics 


Series Editors: 


R. Stephen Berry 
Joseph L. Birman 
Mark P. Silverman 
H. Eugene Stanley 
Mikhail Voloshin 


Graduate Texts in Contemporary Physics 


S.T. Ali, J.P. Antoine, and J.P. Gazeau: Coherent States, Wavelets and 
Their Generalizations 


A. Auerbach: Interacting Electrons and Quantum Magnetism 

N. Boccara: Modeling Complex Systems 

T.S. Chow: Mesoscopic Physics of Complex Materials 

B. Felsager: Geometry, Particles, and Fields 

P. Di Francesco, P. Mathieu, and D. Sénéchal: Conformal Field Theories 

A. Gonis and W.H. Butler: Multiple Scattering in Solids 

K. Gottfried and T-M. Yan: Quantum Mechanics: Fundamentals, 2nd Edition 
K.T. Hecht: Quantum Mechanics 


J.H. Hinken: Superconductor Electronics: Fundamentals and 
Microwave Applications 


J. Hladik: Spinors in Physics 

Yu.M. Ivanchenko and A.A. Lisyansky: Physics of Critical Fluctuations 
M. Kaku: Introduction to Superstrings and M-Theory, 2nd Edition 
M. Kaku: Strings, Conformal Fields, and M-Theory, 2nd Edition 
H.V. Klapdor (ed.): Neutrinos 


R.L. Liboff (ed): Kinetic Theory: Classical, Quantum, and Relativistic 
Descriptions, 3rd Edition 


J.W. Lynn (ed.): High-Temperature Superconductivity 
H.J. Metcalf and P. van der Straten: Laser Cooling and Trapping 


R.N. Mohapatra: Unification and Supersymmetry: The Frontiers of 
Quark-Lepton Physics, 3rd Edition 


V.P. Nair: Quantum Field Theory: A Modern Perspective 


R.G. Newton: Quantum Physics: A Text for Graduate Students 
H. Oberhummer: Nuclei in the Cosmos 

G.D.J. Phillies: Elementary Lectures in Statistical Mechanics 
R.E. Prange and S.M. Girvin (eds.): The Quantum Hall Effect 


(continued after index) 


V. Parameswaran Nair 


Quantum Field Theory 


A Modern Perspective 


With 100 Illustrations 


g) Springer 


V. Parameswaran Nair 

Physics Department 

City College 

Convent Avenue & 138" Street 
New York, NY 10031 


USA 

Series Editors 

R. Stephen Berry Joseph L. Birman Mark P. Silverman 
Department of Chemistry § Department of Physics Department of Physics 
University of Chicago City College of CUNY Trinity College 
Chicago, IL 60637 New York, NY 10031 Hartford, CT 06106 
USA USA USA 

H. Eugene Stanley Mikhail Voloshin 

Center for Polymer Studies Theoretical Physics Institute 

Physics Department Tate Laboratory of Physics 

Boston University The University of Minnesota 

Boston, MA 02215 Minneapolis, MN 55455 

USA USA 


On the cover: The pinching contribution to the interaction between fermions. See page 213 
for discussion. 


Library of Congress Cataloging-in-Publication Data 
Nair, V. P. 
Topics in quantum field theory / V.P. Nair. 
p. cm. 
Includes bibliographical references and index. 
ISBN 0-387-21386-4 (alk. paper) 
1. Quantum field theory. I. Title. 
QC174.45.N32 2004 
530.14’3—dce22 2004049910 


ISBN 0-387-21386-4 Printed on acid-free paper. 


© 2005 Springer Science+Business Media, Inc. 

All rights reserved. This work may not be translated or copied in whole or in part without the written 
permission of the publisher (Springer Science+Business Media, Inc., 233 Spring Street, New York, NY 
10013, USA), except for brief excerpts in connection with reviews or scholarly analysis. Use in connec- 
tion with any form of information storage and retrieval, electronic adaptation, computer software, or by 
similar or dissimilar methodology now known or hereafter developed is forbidden. 

The use in this publication of trade names, trademarks, service marks, and similar terms, even if they 
are not identified as such, is not to be taken as an expression of opinion as to whether or not they are 
subject to proprietary rights. 


Printed in the United States of America. (MVY) 
987654321 SPIN 10955741 


springeronline.com 


To the memory of my parents 


Velayudhan and Gowrikutty Nair 


—a == & 


joneaaghs 


'—@ 


7 


Preface 


Quantum field theory, which started with Dirac’s work shortly after the dis- 
covery of quantum mechanics, has produced an impressive and important 
array of results. Quantum electrodynamics, with its extremely accurate and 
well-tested predictions, and the standard model of electroweak and chromo- 
dynamic (nuclear) forces are examples of successful theories. Field theory has 
also been applied to a variety of phenomena in condensed matter physics, in- 
cluding superconductivity, superfluidity and the quantum Hall effect. The 
concept of the renormalization group has given us a new perspective on field 
theory in general and on critical phenomena in particular. At this stage, a 
strong case can be made that quantum field theory is the mathematical and 
intellectual framework for describing and understanding all physical phenom- 
ena, except possibly for quantum gravity. 

This also means that quantum field theory has by now evolved into such 
a vast subject, with many subtopics and many ramifications, that it is im- 
possible for any book to capture much of it within a reasonable length. While 
there is a common core set of topics, every book on field theory is ultimately 
illustrating facets of the subject which the author finds interesting and fas- 
cinating. This book is no exception; it presents my view of certain topics in 
field theory loosely knit together and it grew out of courses on field theory 
and particle physics which I have taught at Columbia University and the City 
College of the CUNY. 

The first few chapters, up to Chapter 12, contain material which gener- 
ally goes into any course on quantum field theory although there are a few 
nuances of presentation which the reader may find to be different from other 
books. This first part of the book can be used for a general course on field 
theory, omitting, perhaps, the last three sections in Chapter 3, the last two 
in Chapter 8 and sections 6 and 7 in Chapter 10. The remaining chapters 
cover some of the more modern developments over the last three decades, 
involving topological and geometrical features. The introduction given to the 
mathematical basis of this part of the discussion is necessarily brief, and these 
chapters should be accompanied by books on the relevant mathematical top- 
ics as indicated in the bibliography. I have also concentrated on developments 
pertinent to a better understanding of the standard model. There is no dis- 
cussion of supersymmetry, supergravity, developments in field theory inspired 
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by string theory, etc.. There is also no detailed discussion of the renormal- 
ization group either. Each of these topics would require a book in its own 
right to do justice to the topic. This book has generally followed the tenor 
of my courses, referring the students to more detailed treatments for many 
specific topics. Hence this is only a portal to so many more topics of detailed 
and ongoing research. I have also mainly cited the references pertinent to the 
discussion in the text, referring the reader to the many books which have 
been cited to get a more comprehensive perspective on the literature and the 
historical development of the subject. 

I have had a number of helpers in preparing this book. I express my ap- 
preciation to the many collaborators I have had in my research over the years; 
they have all contributed, to varying extents, to my understanding of field 
theory. First of all, I thank a number of students who have made sugges- 
tions, particularly Yasuhiro Abe and Hailong Li, who read through certain 
chapters. Among friends and collaborators, Rashmi Ray and George Thomp- 
son read through many chapters and made suggestions and corrections, my 
special thanks to them. Finally and most of all, I thank my wife and long 
term collaborator in research, Dimitra Karabali, for help in preparing many 
of these chapters. 


New York V. Parameswaran Nair 
May 2004 City College of the CUNY 
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1 Results in Relativistic Quantum Mechanics 


1.1 Conventions 


Summation over repeated tensor indices is assumed. Greek letters ju, v, etc., 
are used for spacetime indices taking values 0, 1, 2,3, while lowercase Roman 
letters are used for spatial indices and take values 1, 2, 3. 

The Minkowski metric is denoted by n,. It has components jo9 = 1, mij = 
—6ij, Noi = 0. We also use the abbreviation 0, = sor. The scalar product 
of four-vectors A, and B, is A- B = ApgBo — A;B;. Such products between 
momenta and positions appear often in exponentials; we then write it simply 
as px. It is understood that this is ppz9 — p- x, where the boldface indicates 
three-dimensional vectors. 

The Levi-Civita symbol e€ 
indices, and ¢!?3 = 1. ¢€#”° is similarly defined with € 

Two spacetime points x, y are spacelike separated if (2 — y)? < 0. This 
means that the spatial separation is more than the distance which can be 
traversed by light for the time-separation |x° — y°|. 

O is also used to denote the boundary of a spatial or spacetime region; 
i.e, OV and O» are the boundaries of V and », respectively. 

We will now give a resumé of results from relativistic quantum mechanics. 
They are merely stated here, a proper derivation of these results can be 
obtained from most books on relativistic quantum mechanics. 


“Jk is antisymmetric under exchange of any two 
Oz il 


1.2 Spin-zero particle 


We consider particles to be in a cubical box of volume V = L?, with the 
limit V — oo taken at the end of the calculation. The single particle wave 
functions for a particle of momentum k can be taken as 


e tke 


whe) = a 


where w, = V/k-k-+m2. We choose periodic boundary conditions for the 
spatial coordinates, i.e., ux(z + L) = ux(x) for translation by L along any 
spatial direction; therefore the values of k are given by 


(1.1) 
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27, 
ki, = — iz 
a (1.2) 
(ni, 72,3) are integers. The wave functions ux(x) obey the orthonormality 
relation 


i d®x [ut(iOyus:) — Gree eae (1.3) 

V : 

where 6,4 denotes the Kronecker 6’s of the corresponding values of n,;’s, i.e., 
Ok,k! = Oni n4, Ono,n/,Ong,n', (1.4) 


In the limit of V — oo, we have 


3 
Okk! > Gar 63) (k — k’) (1.5) 


d?k 


The completeness condition for the momentum eigenstates |k) can be written 
as 


[ie Seage H =2 i 


where kp = wr. 
The wave functions uz are obviously solutions of the equation 


jun 2 / aye (1.8) 


The differential operator on the right-hand side is not a local operator; it has 
to be understood in the sense of 


J-V? + m? f(a) = be JK? +m? f(k (1.9) 


— 


where 


se) = [ FR F0) (1.10) 


One can define a local differential equation for the ux’s; it is the Klein-Gordon 
equation 

(O + m?)u(z) = 0 gla) 
where [] is the d’Alembertian operator, 0 = 0,04 = (09)? — V?. 

One can take the Klein-Gordon equation as the basic defining equation 
for the spinless particle and construct u,(x) as solutions to it. The inner 
product is then determined by the requirement that it be preserved under 
time-evolution according to the Klein-Gordon equation. The inner product 
for functions u, v obeying the Klein-Gordon equation is thus given by 


1.3 Dirac equation a 
(ulv) = pee [u*tOov — iOou™ v] (1.12) 
The timé-derivative of this gives 
Oo(ulv) = [ee [Oow*idov — idpu* Oov + iu*d2u — idgu"v] 
= pes [iu*(V? — m?)v — i(V? — m?)u*v] 
== pee V - [u"iVu —iVu* v] 
= dS - [u*iVu —iVu* v] 
OV 

= (1.13) 


The last equality follows from the periodic boundary conditions. We see that 
this inner product is-preserved by time-evolution according to the Klein- 
Gordon equation; this is the reason that (1.12) is the correct choice and (1.3) 
is the correct form of the orthonormality condition to be used for this case. 


1.3 Dirac equation 


The basic variables are W(x), r = 1,2,3,4, which can be thought of as a 
column vector. Each W,(z) is a complex function of space and time. The 
Dirac equation is given by 


(—i7#,0, + m6,5)¥s(x) = 0 (1.14) 
This can be written in a matrix notation as 
(—iy"d, + m1)¥(r) =0 (1.15) 


Here 1 denotes the identity matrix , 1 = 6,,. y” are four matrices obeying 
the anticommutation rules, or the Clifford algebra relations, 


tay’ + y’yh = 2n¥1 (1.16) 


One set of matrices satisfying these relations is given by 


ra(83), (22) am 


The identity in the above expression for 7° is the 2 x 2-identity matrix. The 
gamma matrices are 4 x 4-matrices. a’ are the Pauli matrices. 


taro 2. (0 —4 3 {1 0 
o = ae eo we oa 4 (1.18) 
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Clearly, a similarity transform of the above set of ‘y’s will also obey the 
Clifford algebra. The fundamental theorem on Clifford algebras states that 
the only irreducible representation of the y-matrices is given by the above 
set, up to a similarity transformation. 

The Lagrangian for the Dirac equation is 


L=W(iy-d0-—m)v ; (1.19) 
W is related to the conjugate of W as 
w= pty (1.20) 
The Lorentz transformation of the Dirac spinor is given by 
W' (xz) = S$ W(L“z) (1.21) 


where x’ = (L)#zx” is the Lorentz transformation of the coordinates. In- 
finitesimally, z2’’ ~ zc + whx’, where w+” = —w” are the parameters of the 
Lorentz transformation. The transformation of the spinors is then given by 


We) = (1 = Su" My) H(2) : (1.22) 


Muy =t(tpO, —ZvOn) + Spy (1.23) 


Spy is the spin term in M,,, 


1 
Spy = — ql W] (1.24) 


By evaluating Siz = S3, one can check that W corresponds to spin T Some 
further details on relativistic transformations are given in the appendix. 

There are two types of plane wave solutions, those with pp = \/p? + m? = 
E, and those with pop = —Ep = —\/p* + m?. They can be written as 


D(a) = uy(p) e~** = up(p) eB HP® (1.25) 
for the positive-energy solutions and 
W(x) = v,(p) e* = v,(p) 22" —P2 (1.26) 


for the negative-energy solutions. In these equations we have written the signs 
explicitly in the exponentials, so that po in px is E for both cases. 
The spinors u,(p), v-(p), r= 1,2, are given by 


Ur(p) = B(p)wr, vr (p) = B(p)w, : (1.27) 


where 
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tf 0 0 0 

0 1 C 0 _ 0 
m=]: ua =) We ti=|, 4, w= |/F (1.28) 

0 0 0 1 

and 
E+m o-Dp 
2m J 2m(E +m) 

B(p) = 1.29 
(p) ae ae (1.29) 


V 2m(E +m) 2m 


Here E = ,/p? +m? and we have used the representation for the gamma 
matrices given earlier. 

It is easily seen that B(p) is the boost transformation which takes us 
from the rest frame of the particle to the frame in which it has velocity 
v' = p'/E. From the Lorentz transformation properties, it is clear that Viw 
is not Lorentz invariant. So we have chosen a Lorentz invariant normalization 
for the wave functions 


u,(p)us(p) = Sra; 0, (p)Us (p) = —Ors (1.30) 
Using the definition of B(p), we can establish the properties 
+m = “D—m 
ye ur(p)t,(p) = “ — amy aie) Ye vr (p)d,-(p) = Wee (1.31) 


The completeness relation for the solutions is expressed by 


>_ ur(p)tr(p) — vr (p)O-(p) = 1 (1.32) 


r 


Further 


tir(p)Hte(p) = © bro = Oe (p)7"V0(0) (1.33) 
ul(p)vs(p) = vu} (p)us(—p) = 0 (1.34) 
The chirality matrix ys is defined by 
eas ht tt 
= Fewae” aya? (1.35) 


In the explicit representation of y-matrices given above 


me () i (1.36) 


Another useful representation is 
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IAalg i | (1.37) 


The left and right chirality projections are defined by 
1 1 
Vy = 5 + 75)¥, VR = g(t =e (1.38) 


They correspond to eigenstates of y5 with eigenvalues +1, respectively. 
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2 The Construction of Fields 


2.1 The correspondence of particles and fields 


Ordinary point-particle quantum mechanics can deal with the quantum de- 
scription of a many-body system in terms of a many-body wave function. 
However, there are many situations where the number of particles is not 
conserved, e.g., the @-decay of the neutron, n — p+e-+,. There are also 
situations like e*e~ —+ 27 where the number of particles of a given species 
is not conserved, even though the number of particles of all types taken to- 
gether is conserved. In order to discuss such processes, the usual formalism 
of many-body quantum mechanics, with wave functions for fixed numbers of 
particles, has to be augmented by including the possibility of creation and 
annihilation of particles via interactions. The resulting formalism is quantum 
field theory. 

In many situations such as atomic and condensed matter physics, a nonrel- 
ativistic description will suffice. But for most applications in particle physics 
relativistic effects are important. Relativity necessarily brings in the possi- 
bility of conversion of mass into energy and vice versa, i.e., the creation and 
annihilation of particles. Relativistic many-body quantum mechanics neces- 
sarily becomes quantum field theory. Our goal is to develop the essentials of 
quantum field theory. 

Quite apart from the question of creation and annihilation of particles, 
there is another reason to discuss quantized fields. We know of a classical field 
which is fundamental in physics, viz., the electromagnetic field. Analyses by 
Bohr and Rosenfeld show that there are difficulties in having a quantum 
description of various charged particle phenomena such as those that occur 
in atomic physics while retaining a classical description of the electromagnetic 
field. One has to quantize the electromagnetic field; this is independent of any 
many-particle interpretation that might emerge from quantization. Similar 
arguments can be made for quantizing the dynamics of other fields also. 

There are two complementary approaches to field theory. One can postu- 
late fields as the basic dynamical variables, discuss their quantum mechanics 
by diagonalization of the Hamiltonian operator, etc., and show that the re- 
sult can be interpreted in many-particle terms. Alternatively, one can start 
with point-particles as the basic objects of interest and derive or construct 
the field operator as an efficient way of organizing the many-particle states. 
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We shall begin with the latter approach. We shall end up constructing a field 
operator for each type or species of particles. Properties of the particle will 
be captured in the transformation laws of the field operator under rotations, 
Lorentz transformations, etc. The one-to-one correspondence of species of 
particles and fields is exemplified by the following table. 


Particle 
Spin-zero bosons, o(a,t), ¢ is a real scalar field 
Charged spin-zero bosons g(a, t), ¢ is a complex scalar field 


Photons (spin-1, massless bosons) A,, (a, t), real vector field 
(Electromagnetic vector potential) 


Spin-$ fermions (e+, quarks, etc.) w,(x, t), a spinor field 


The simplest case to describe is the theory of neutral spin-zero bosons, so we 
shall begin with this. 


2.2 Spin-zero bosons: construction of the field operator 
We consider noninteracting spin-zero uncharged bosons of mass m. The wave 


function u;(x) for a single particle of four-momentum k,, was given in Chapter 
1. With the box normalization, 


(2.1) 


The states of the system can evidently be represented as follows. 


|0) = vacuum state, state with no particles. 


|1.) = |k) = one-particle state of momentum k, energy ko = V k* + m2 = wp. 
[1k,, lk.) = |ki,k2) = two-particle state, with one particle of momentum k, 
and one particle of momentum k2, with corresponding energies. 

|X, k2,---) = Many-particle state, with Nk, particles of momentum kj, nx, 
particles of momentum kg, etc. 


We now introduce operators which connect states with different numbers 
of particles. It is sufficient to concentrate on states |0),|1,),|2x),.../2%) with 
a fixed value of k, introduce the connecting operators and then generalize to 
all k. We thus define a particle annihilation operator a, by 
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ax|N~) = On|n_, — 1) (2.2) 
Since the vacuum has no particles, we require 
a;,|0) = 0 (2.3) 
The many-particle states are orthonormal, i.e., 
(0/0) = (1e|1k) = (24|2k) =... = 1 (2.4) 
(nk|nj,) = 0, ne # Ny (2.5) 
From (2.2), we can then write, omitting the subscripts k for a while, 
(n — 1la|n) = an (2.6) 
Since (7)|A¢) = (Alw|¢) for an operator A, (2.6) gives 
(al(n —1)|n) = an (2:7) 


This shows, with the orthogonality (2.5), that at|n—1) must be proportional 
to |n). Thus a! is a particle creation operator and we may write, from (2.7), 


at|n) = of 4 ,\n+1) (2.8) 
The operators aa! and ata are diagonal on the states. We have 
ata|n) = |an|? |n) (2.9) 


Further, a'a|0) = 0 using (2.3); thus ao = 0. 

The only quantum number characterizing the state |n), since we are look- 
ing at a fixed value of k, is the number of particles n. We shall thus identify 
ata as the number operator, i.e., the operator which counts the number of 
particles; this is the simplest choice and gives ay, = ,/n. (An irrelevant phase 
is set to one.) Notice that aat, the other diagonal operator, is not a suitable 
definiton of the number operator, since (O|aa'|0) = 1. With the identification 
of ata as the number operator, we have 


aln) = J/n |n— 1), a'|n) = V/n+1 |n+1) (2.10) 
These properties of a, at may be summarized by the commutation rules 
fa,a}=0, f[atjat}=0, {a,at}=1 (2.11) 


In fact, these commutation rules serve as the definitions of the operators 
a, a'. With the definiton of the vacuum by a/0) = 0, (0|0) = 1, we can 
recursively build up all the states. 

So far we have discussed one value of k. We can generalize the above 
discussion to all values of k by introducing a sequence of creation and anni- 
hilation operators with each pair being labeled by k. Thus we write 
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Oy tip tly; Oe J My, ltt THR, = 1) epee. ) 
Qt tk) May + +s Mey 6 Vk +1 [e, , Mkgsssente Lee? 
(2312) 
with the commutation rules 
[ax,, a1] = 0, [ai a!] = 0, (az, af] = det (2.13) 


Our discussion has so far concentrated on the abstract states, labeled by 
the momenta. It is possible to represent the above results in terms of the 
wave functions (2.1). We can actually combine the operators a,, al with the 
one-particle wave functions u,(x) and define a field operator $(x) by 


oe) = >> [ax u(x) + af, uZ(x)| (2.14) 


k 


Since uz, and uj obey the Klein-Gordon equation, we see that (x) obeys the 
Klein-Gordon equation, viz., 


(O+m?)¢(z) = 0 (2.15) 


As we noticed in Chapter 1, the wave functions actually obey the equation 


ics = /—-V?24+m? uz (2.16) 


The operator /—V? +m? is not a local operator. Since we would like to 
keep the theory as local as possible, we choose the second-order form of the 
equation. One may also wonder why we could not define a field operator 
just by the combination 5°, a,uxz or its hermitian conjugate. The reason is 
that, once we decide on the Klein-Gordon equation rather than its first order 
version (2.16), the complete set of solutions include both the positive and 
negative frequency functions, i.e., both uz(x) and uz(x). Combining these 
together as in (2.14), we can reverse the roles of (2.14) and (2.15). We can 
postulate (2.15) as the fundamental equation for ¢(x), and then the expansion 
of ¢(x) in a complete set of solutions will give us (2.14). The coefficients of 
the mode expansion, viz., ax, at are then taken as operators satisfying (2.13). 
This leads to a reconstruction of the many-particle description, but with the 
field ¢(x) as the fundamental dynamical object. Notice that the negative 
frequency solutions, which are difficult to be interpreted as wave functions in 
one-particle quantum mechanics, now naturally emerge as being associated 
with the creation operators. 

In terms of the field operator 4(x), the many-particle wave function for a 
state |nz,, Nk, ---) may be written, up to a normalization factor, as 


Ati, 0)... 0N oe (Ol parr) Plea)... Hea Mas ++) (2.17) 
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where N = nx, + nx, +... From the fact that the a,’s commute among 
themselves, we see that the wave function V(x, 22...) is symmetric under 
exchange of the positions of particles. The particles characterized by the 
commutation rules (2.13) are thus bosons. 

To recapitulate, we have seen that we can introduce creation and annihi- 
lation operators on the Hilbert space of many-particle states. They obey the 
commutation rules (2.13); the field operator ¢(x) is constructed out of these 
and obeys the Klein-Gordon equation. Conversely, one can postulate the field 
$(x) as obeying the Klein-Gordon equation; expansion of ¢(x) in a complete 
set of solutions gives (2.14). The amplitudes or coefficients of this expansion 
can then be taken as operators obeying (2.13). One can then recover the 
many-particle interpretation. 

The field operator ¢(x) is a scalar; it is hermitian and so, corresponds, 
classically to a scalar field which is real. The particles described by this field 
are bosons. 


2.3 Lagrangian and Hamiltonian 


The field operator ¢(x) obeys the equation of motion 
(+ m?)¢ =0 (2.18) 


If d(x) were not an operator but an ordinary c-number field y(z), we could 
write down a Lagrangian and an action such that the corresponding vari- 
ational equation (or extremization condition) is the Klein-Gordon equation 
(2.18). Such a Lagrangian is given by 


L= i [(0,p0"y) — my?| (2.19) 


with the action, for a spacetime volume 2’, 


S= i) d‘x L (2.20) 
23h 


The equation of motion can be derived as the condition satisfied by the fields 
which extremize the action S with fixed boundary values for the fields; i.e., 
as the condition 6S = 0. We find 


ss= | dx [—(A+m’)y] dy + $. do* (O,~)dp (21) 
a 


We consider variations with the value of y fixed on the boundary 0’ of ». 
ie., dy = 0 on OF and the extremization of the action gives the equations of 
motion 

(A+ m?)p = 0 (2.22) 


since dy is arbitrary in the interior of 2’. 


12 2 The Construction of Fields 


Notice that the Lagrangian £ is a Lorentz scalar. If we write the action 
as 


S= je dx [3 (op)? — 4{(Vy)? +m?y"}] (2.23) 


we see that it has the standard form f dt (T — U), with the kinetic energy 
T = f dx (ay)? and potential energy U = f d°x 3[(Vy)? + my]. The 
Hamiltonian is given by 


fr 72 i) dx } (doy)? + (Vo)? + m2y"] (2.24) 


If we now replace the c-number field y by the field operator ¢(x), we get a 
Hamiltonian operator 


H= / d?x 4 [(do¢)? + (Vo)? + m?¢?] (2.25) 
Use of the mode expansion (2.14) for ¢(x) gives 


teh So we al at +>) 5 Wh (2.26) 
k k 


where wy, = ky = Vk? + m2. Acting on the many-particle states, al ag is the 
number of particles of momentum k, and thus H in (2.26) gives the energy 
of the state, except for the additional term }°, 5 Wk This term is the energy 
of the vacuum state and is referred to as the zero-point energy. It arises 
because of the ambiguity of ordering of operators. The c-number expression 
(2.24) does not specify the ordering of a,’s and al’s when we replace y by 
the operator ¢. We have to drop the zero-point term in (2.26) and define the 
Hamiltonian operator as 


H= > ux alax (2.27) 
k 


to obtain agreement with the many-particle description. Actually there are 
more fundamental reasons to subtract out the zero-point term as we have 
done. This has to do with the Lorentz invariance of the vacuum, as will be 
explained later. For the moment, we may take it as part of the rule of quan- 
tization, i.e., in replacing y by the operator ¢, we must choose the ordering 
of operators such that the vacuum energy is zero. 

Analogous to the definition of the Hamiltonian, we can define a momen- 
tum operator 


P; = k; al ax (2.28) 
k 


which can be checked to give the total momentum of a many-particle state. 
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The Lagrangian has essentially all the information about the theory; it 
gives the equations of motion, operators such as the Hamiltonian and mo- 
mentum, the commutation rules, as we shall see later, and is a succinct way 
of specifying interactions, incorporating symmetries, etc. It will play a major 
role in all of what. follows. 


2.4 Functional derivatives 


A mathematical notion which is very useful to all of our discussion is that of 
the functional derivative. The action S is a functional of the field y(z), ice., 
its value depends on the specific function y(x) we use to evaluate it. More 
concretely, we may specify y(x) by an expansion in terms of a complete set 
of functions f,(z) as 


ESD TERE) (2.29) 


We can specify the function y(x) by giving the set of values {c,}. One set 
of values {c,} gives one function, a different set {c’,} will give a different 
function and so on. Thus variation of the functional form of y(z) is achieved 
by variation of the c,,’s; i.e., 


ple) + 69(2) =~ (cn + den) fale) (2.30) 


5p(z) => ben f(x) (2.31) 


A functional, i.e., a quantity that depends on the functional form of another 
quantity v(x), can be written generically as 


itv = I d's p(p,d,...) (2.32) 


For most of the applications in our discussions, we shall only need the varia- 
tions of functionals like I[y] when we change y in the interior of 2’, keeping 
the values of y on the boundary fixed. This means that we can evaluate 
the variation of I{y] by carrying out partial integrations if necessary, using 
6y = 0 on OS. The variation can then be brought to the form 


dI [yp] = [ae o(x)dy(z) (2.33) 


The functional derivative =o is then defined as o(a), the coefficient of 


dy(x). For example, 
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dp(z) = 644 — y) 


dy) 
) 
say [tty ew) = 2908) 
7) 
Foray fo? (00)? = 2(-Oela)) (2.34) 
From (2.21,2.22), we see that 
iia) = Gt mole) (2.35) 


and the equation of motion is just so =0, 

We shall now express a little more precisely the ideas of functional vari- 
ations and derivatives. v(x) is real-valued, so let us define a space which is 
the set of all real-valued functions from the spacetime region X to R, the 
real numbers. Since we shall be considering functionals like the action, which 
involve integrals of y? and (Oy)?, we require further that the functions we 
consider satisfy 


| d‘z y? < 00, | d*z (Oy)? < 00 : (2.36) 
oF » 
We may thus specify the function space F as 


F = {set of all y's such that py : 2 > R, 
with the finiteness conditions (2.36)} (2.37) 


Elements of F are functions; if desired, one can also define a mode expansion 
which furnishes a basis for F. A functional like the action is simply a map 
from F into the real numbers; i.e., it is a real-valued function on F. The 
functional derivative is thus the usual notion of derivative applied to this 
function. Of course, the function space F is infinite-dimensional, since in 
general we need an infinite number of functions f,,(x) to obtain a basis; as a 
result, one has to be careful about the convergence of sums and integrals. 

The conditions (2.36) are relevant for the problem of the scalar field. In 
different physical situations, the conditions defining a suitable function space 
may be different. Likewise, the functions may not always be real-valued. In 
any case, it is clear that one can, in a way analogous to what we have done, 
define a suitable function space and functional derivatives. 


2.5 The field operator for fermions 


The wave functions for free spin-3 particles have been given in Chapter 1 as 
the solutions of the Dirac equation. We shall now introduce the creation and 
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annihilation operators. a sien and creation operators for the particle 
are denoted by ap, and al ,, and those for the antiparticle are denoted by 
bp, and bf. (r labels the spin states.) The important difference with the 
spin-zero case is that spin-5 particles are fermions. (This is part of a general 
result, which tells us that integral values of spin correspond to bosons and 
half-odd-integral values' of spin to fermions. This “spin-statistics theorem” 
will be discussed later.) For fermions, we have the exclusion principle; there 
cannot be double occupancy of any state. Consider a fixed value of momentum 
and fixed spin state. Dropping indices for the moment, the states are |0), 


c |1) = a'|0), where c is a normalization factor and |2) = (at)?|0) = 0. 
Since there cannot be a two-particle occupancy of the state, we need (at)? = 
0, (b')? = 0, which also gives 
a? = 0, b? =0 (2.38) 
The vacuum state or the state of no-particles |0) obeys 
a|0) = b|0) = 0 (2.39) 


We can define ata as the particle number operator, as before. This leads to 
a'al0) = 0, ataj1) = |1) (2.40) 


This shows that a|1) = (1/c)|0) and the above equation, along with this, 
gives |c|? = 1 from the orthonormality of states. We also have the results 
(O|aat|0) = |c|? and (1|aa*|1) = 0. The combination aa! + ata is thus equal 
to one, on both the states |0) and |1). We shall thus use the anti-commutation 
rules 

a*=0, (at)?=0, aal + afa=1i (2.41) 


for the operators a, a', and similarly for the antiparticle operators. Notice 
that it is inconsistent to impose a rule like aat — ata = constant. The gener- 
alization of the rules (2.41) with momentum and spin labels is 
t t ae) 
ap,r Ay, 5 RE Oy s2p,r rs Pk 


barDL ae te = bre5p,k 


Ap,rAk,s + Ak,sAp,r = 0, at a}. Foie a}, i al. —=(() (2.42) 
bp,rbte,s + bk, sb, = 0, of bt, + of bt. =0 

Ap,rdk,s + bk,sAp,r = 9, Tae ye bf se == 0) 

at bis + br,2a},,, = 0, ald), +d} at, =0 


It can also be checked that, starting from these rules and defining the vacuum 
state by ap,r|0) = bp,r|0) = 0, we can recursively obtain all the multiparticle 
states of the fermions. 
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We now combine these operators with the one-particle wave functions 
to construct the fermion field operator. We can combine u,(p) e~*?? with 
@p,r. The solution v,(p) e’?* has an exponential e’”*, indicating that it must 
be interpreted as the conjugate wave function, corresponding to creation of 
particles. It must be combined with a creation operator. However, we cannot 
use af; if we do, the combination ap,,uy(p) e~'?” + a},vr(p) e'?? does not 
have definite fermion number or charge, since one term annihilates particles (a 
process with a change of —1 for fermion number) and the other term creates 
them (a process with a change of +1 for fermion number). We must thus use 
bh 5 this is consistent since annihilating particles and creating antiparticles 
change charge or fermion number by the same amount. The field operator is 


thus given by 


B(2) => fa [aoe ur(wle™* + Bf, or(D)e”™] 
PT P 


w(x) = ae om [al ar (p)e?* + bp,rdr(p)e*?* | (2.43) 


These obey the equations 


(iy-0-—m)v=0, 10," — mh =0 (2.44) 


We have used the complete set of solutions to the equations (2.44); one may 
therefore think of (2.44) as the starting point. Writing the general mode 
expansion for the fields 7 and 7, one can interpret the coefficients as operators 
obeying the anti-commutation rules (2.42) and thus recover the many-particle 
picture. 
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3 Canonical Quantization 


3.1 Lagrangian, phase space, and Poisson brackets 


In this chapter we develop the essentials of canonical quantization. Instead of 
constructing fields in terms of particle wave functions, we consider fields as 
the fundamental dynamical variables and discuss how to obtain a quantum 
theory of fields. 

We shall first consider bosonic fields. The fields will be denoted by ¢,(z). 
The index r or part of it may be a spacetime index for vector and tensor fields; 
it can also be an internal index labeling the number of independent fields. 
The Lagrangian C is a scalar function of y,(x) and its spacetime derivatives. 
We shall assume that the equations of motion are at most second order in 
the time-derivatives. Correspondingly, £ involves at most (Oy). This is 
the most relevant case. If the equations of motion involve higher-order time- 
derivatives of the fields, there are usually unphysical ghost modes (modes 
which have negative norm in the quantum theory). (There is a generalization 
of the canonical formalism for theories with higher than first-order derivatives 
in time; this is due to Ostrogradskii.) Higher powers of (Oo) also generally 
lead to difficulties in quantization and do not seem to be relevant for any 
realistic situation. We shall not discuss these situations further. 

Since the Lagrangian has at most the square of (Oo~), we expect, based 
on Lorentz invariance, that £ is at most quadratic in space-derivatives as 
well. (There are some topological Lagrangians with one time-derivative and 
several different space-derivatives of fields. We will not consider them here; 
some examples are briefly discussed in Chapter 20 which describes geometric 
quantization.) The action in a spacetime volume » can be written as 


s= | dz L(p.,d.0.) (3.1) 
poy 


The spacetime region will be taken to be of the form V x [t;, ¢,], where V is a 
spatial region. The equations of motion are given by the variational principle, 
viz., the classical trajectory y,(x, t), which connects specified initial and final 
field configurations y,(a,t;) and y,(x,ty) at times t; and ty, extremizes the 
action. In other words, we can vary the action with respect to y(x,t) for 
t; <t<ty and set 6S to zero to obtain the equations of motion. Explicitly 
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OL Oe 
ny ee — n) T ar ) 6 if 
dp” Oe 
OL (6) OL O ( OL ) 
Pe cs + eh elie 3.2 
Bp, Ba" OO cial kaa ae Bee) 


(Summation over the repeated index, in this case r, is assumed as usual.) 
When we integrate the variation of £ over the spacetime region 2’ to obtain 
5S, the second term in (3.2), being a total divergence, becomes a surface 
integral over 05’. Since we fix the initial and final field configurations y,(2, t;) 
and y,(x,ts), gr = 0 at t;, tr. Further, we assume that either dy, or BO. ¢) j 
vanishes at the spatial boundary OV. Eventually, we are interested in the 
limit of large spatial volumes; this condition is physically quite reasonable 
in this case; alternatively, we could require periodic boundary conditions for 
the spatial directions. Either way the surface integral is zero and 


0 oO 
42 |= -z— a | 8 3.3 
ss | te | a oan = 


The extremization condition 6S = 0 now yields the equations of motion, since 


dy, is arbitrary, as 
OL 0 OL 


de, Ox" (A,~er) 


We now consider more general variations of fields, with dy, not zero at 
t; or tz. The total divergence term in (3.2) integrates out to O(ts} — O(t:), 


where 
O(t =| he an OY, Se5 
( ) V O(Oov”r) ( ) 


This quantity O is called the canonical one-form. 

In the variation of the action when using the variational principle, we 
specify the initial and final values of the field configurations. Since there 
is then a unique classical trajectory, we may say that the initial and final 
values label the classical trajectories. The set of all classical trajectories is 
defined to be the phase space of the theory. Alternatively, we can specify the 
classical trajectories by the initial data for the equations of motion rather 
than initial and final values for the field. Since our equations are second 
order in time-derivatives, the initial data are clearly y,(z,t) and Ooy,(z,t), 
at some starting time t. It will be more convenient for the formalism to use 


OL 
O(Oo¢r) 


rather than Ooy,. The phase space for a set of scalar fields is thus equivalent 
to the set {7,(x), y,(a)} (for all 2) which is used to label the classical tra- 
jectories. The phase space for a field theory is obviously infinite-dimensional. 
Tr is called the canonical momentum conjugate to ¢,. 


=0 (3.4) 


T(x, t) = (3.6) 
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The canonical one-form @ can be written as 
O= / dx 15%, (3.7) 
V 


(The name is due to the fact that this is a differential one-form on the phase 
space, as will be explained in Chapter 20.) We will denote the phase space 
variables (coordinates on the phase space) by £*(a) for a general dynamical 
system, which could be more general than a scalar field theory. The canonical 
one-form © is identified from the surface term in the variation of the action 
and has the general form 


Cie | PonAi(2jx) 66a) (3.8) 


where A; could depend on €. (For the scalar field £* = (a,,y,) and A; = 
(z,,0).) Given @, we define 


Neo 6 Ealia)\ é F 
Q(z, x ) = d€*(a) A; (x ) 5€3 (x’) A;(z) 
= 0,;A;— 0 ;Ar = —2;i(a", x) (3.9) 


where in the last line, we have introduced the composite indices I = (i, z) 
and J = (j, x’) and 0; = 6/6£*(x) to avoid clutter in the notation. 2 is called 
the symplectic structure or the canonical two-form. (It can be considered as 
a differential form on the space of fields and their time-derivatives.) Just as 
the metric tensor defines the basic geometric structure for any spacetime, (2 
defines the basic geometric structure of the phase space. Notice that from the 
definition of 2, we have the Bianchi identity 


O;,2jK + O;2QK1 + OKN1z =0 (3.10) 


A concept of central importance in canonical quantization is that of a 
canonical transformation and the generator associated with it. Let €° > 
€' + a*(€) be an infinitesimal transformation of the canonical variables. This 
transformation is called canonical if it preserves the canonical structure 92. 
The change in 2 arises from two sources, firstly due to the €-dependence of 
the components 277 and secondly due to the fact that Q;, transforms under 
change of phase space coordinate frames. ({2;7 transforms as a covariant 
rank-two tensor under change of coordinates.) The total change is 


6277 = Oray = Oya 
5 nee cs 
62;;(@, x’) = FeowG = aay) (3.11) 


where 
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ay =a* Nx 


a(x) = [ee a¥ (a!) Qs (x", a) (3.12) 


(In these equations, we have expanded out the composite notation to show 
how it works out.) From (3.11, 3.12), we see that the transformation ¢* — 
€' + a*(€) will preserve 2 and hence be a canonical transformation, if 


ar = a* Qk; = —0;G (3.13) 


for some function G of the phase space variables. G so defined is called the 
generator of the canonical transformation. (Equation (3.13) is a necessary 
and sufficient condition locally on the phase space. If the phase space has 
nontrivial topology, the vanishing of 6{2 may have more general solutions. 
Even though locally all solutions look like (3.13), G may not exist globally 
on the phase space. We shall return to the case of nontrivial topology in later 
chapters.) 

If we add a total divergence 0, F to the Lagrangian, the equations of 
motion do not change, but O changes as O — 0 + 6 f dx F°. This is of 
the form (3.13) with Ay + A; + 0; f F° and hence 2 is unchanged. Thus 
the addition of total derivatives to a Lagrangian is an example of a canonical 
transformation. 

The inverse of 92 is defined by (-1)!7 Q7x = 61. which expands out as 


/ dPx!(Q-)'5 (a, 2") Qjp(w’, 2”) = 64 6a —2") (3.14) 
Vv 


As will be clear from the following discussion, it is important to have an 
invertible 277. If Q is not invertible, the Lagrangian is said to be singular. 
There are many interesting cases, e.g., theories with gauge symmetries, where 
it is not possible to define an invertible 2 in terms of the obvious field vari- 
ables. One has to define a nonsingular 2 in such cases, by suitable elimination 
of redundant degrees of freedom. (A gauge theory is an example of this; the 
redundant variables are eliminated by the procedure of gauge-fixing.) 
Using the inverse of 2, we can rewrite (3.13) with an 27! on the right- 
hand side as 
a =(0 2) ae (3.15) 


The discussion from equation (3.11) to (3.15) shows that to every infinitesimal 
canonical transformation, modulo the topological issues mentioned above, we 
can associate a function G on the phase space, and conversely, given any 
function G we can associate to it an infinitesimal canonical transformation. 

The change of any function under the transformation £! — &€! + a! is 
given by the action of the functional differential operator V, = a!0;. The 
commutator of two such transformations is given by 


[Va, Vo] = (a7 076" — b7 Aza") 0; (3.16) 
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Let F’,G be the functions associated, via (3.15), with a! and 6’, respectively. 
We then find 
at Oyh! = b7A,;a! = ea a i -_ we yee) | OLF OxG 
+(2Q7*)72(Q-1)!* (0; FOj;0«KG — 050K FOLG| 
| (3.17) 


(G2) 0; (Qa = (K a L) = (ai (O= 4 O 1 N* 
4 (On Quy ate 0;2Nn mM] 
= =(Q71)i4 (2-*)24(Q71)N¥ Oy Qn 
= (27 Ou (Q*)8* (3.18) 


where we have used the identity (3.10). This can now be used to simplify 
(3.17) as 


a” A;b' — b’ Aza? = (271)!™ Oy [(27')*OnGOz,F] (3.19) 


This shows that if a - F, b — G, the commutator of the corresponding 
infinitesimal transformations corresponds to a function —{F,G}, where 


{F,G} = (271)'/8;F4;G 


3 OF 6G 
3 bie —1lyz t 
= d : — — —- 
[os be OW ee ge RGR) 
The function {FG} is called the Poisson bracket of the functions F and G. 
It arises naturally in the composition of canonical transformations. For the 


€!°s themselves, we find {€/,€7} = (2-1)¥7 or 
{é*(w), €(x')} = (2)9 (@, 2") (3.21) 


Notice also that if O has the simple form (3.7), the Poisson bracket of two 
functions F(z, y), G(7,y) of the phase space variables becomes 


SF 6G 6F 6G 
= 3 ieee = 
ae fa = ins ae a 


Comparing equation (3.13) with the definition of Poisson brackets, we see 
that it is equivalent to 


(3.20) 


(3.22) 


al = 6¢! = {¢',G} (3.23) 


In fact, this equation may be taken as the definition of the generator. Con- 
versely, for any function G on the phase space, the transformations on et 
defined by (3.23), i-e., Poisson brackets with G, are canonical. Notice that for 
the simple case of £* = (,, Y,), (3.23) is equivalent to 


6G 6G 
Op, (x) = prey én, (x) = 5 
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More generally, the change of any function F under a canonical transforma- 
tion generated by G is given by 


6F =a'0;F = (271)'48;GO;F 
= {F,G} (3.25) 
These equations show why Poisson brackets are important. The change of 
any variable, so long as it is canonical, is given by the Poisson bracket of the 


variable with the generating function for the transformation. 
We now find the generators of some important canonical transformations. 


1. Change of y,(x). 
For (r > Yr + a,(x), Tr —? Tr, 


a= i ic an(ee) (3.26) 


2. Change of 7,(x). 
For Yr > Yr, Tr 4 Tr + a(x), 


Gas [ Praia, : (3.27) 


3. Space translations. 
For x* — x’ + a’, a’ being constants, 6y, = a*0;y,, da, = a'O;7, and 


G= / dx a'0;0,m, = a'P; (3.28) 
Vv 


Po= | Ba Oippt, : (3.29) 
Vv 
The generator of space translations, P;, is the momentum of the system. 


4. Time translations. 
The generator of time translations is the Hamiltonian H(z, vy); this is the 


definition of the Hamiltonian. From (3.24), this means that the equations of 
motion should be of the form 


6H 6H 
ofr = ne’ OoTr = ~ 80, (3.30) 
One can easily see that 
i= [ee (7,Oo9r — L) (3.31) 


The easiest way to check this is to use (3.31) to write the action as 
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ore [ee TrOoPr — fa H (3.32) 


and then use the variational principle to write the equations of motion. The 
equations of motion so obtained are seen to be (3.30), showing the consistency 
of (3.31) as the generator of time translations. 

The Hamiltonian and momentum components can be expressed in terms 
of an energy-momentum tensor T,,, defined by 


OL 


Tay = OnPrscangy 


— NuvLl + O* Boy (3.33) 
where Bayr is related to spin contributions. (We discuss this a little later in 
this chapter.) In terms of T,,,, 


P, =(H,P;) = / Bx Ti ~ (3.34) 
V 


Notice that the tensor Bg, does not contribute to the expressions for P,,. 


5. Lorentz transformations. 
For Lorentz transformations, dr“ = w'”’z,. The generator of Lorentz 
transformations can be checked to be 


Muy = | Pr lee io Sol uo) (3.35) 
V 
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As with any quantum mechanical system, the states are represented by vec- 
tors (actually rays) in a Hilbert space H. The scalar product (y|a) = Waly] 
is the wave function of the state |a) in a y-diagonal representation; it is the 
probability amplitude for finding the field configuration y(a) in the state |a). 

Observables are represented by linear hermitian operators on H. Fields 
are in general linear operators on H, not necessarily always hermitian or 
observable. We have the operator ¢,(x, t) corresponding to y,(a,t) and the 
operator 7,(xz,t) corresponding to the canonical momentum. 

The change of any operator F under any infinitesimal unitary transfor- 
mation of the Hilbert space is given by 


i 6F = FG-GF =[F,G (3.36) 


where G is the generator of the transformation; it is a hermitian operator. 
If we were to start directly with the quantum theory, we can regard this 
as the basic postulate. The fact that observables are linear hermitian oper- 
ators follow from this because observations or measurements correspond to 
infinitesimal unitary transformations of the Hilbert space. 
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However, in starting from a classical theory and quantizing it, we need a 
rule relating the operator structure to the classical phase space structure. The 
basic rule is that, in passing to the quantum theory, canonical transformations 
should be represented as unitary transformations on the Hilbert space. The 
generator of the unitary transformation is obtained by replacing the fields 
in the classical canonical generator by the corresponding operators. (This 
replacement rule has ambiguities of ordering of operators; e.g., classically, 
Trpr and YT, are the same, but the corresponding quantum versions 7,¢, 
and ¢,7, are not the same, since ¢, and 7, do not necessarily commute. The 
correct ordering for the quantum theory can sometimes be understood on 
grounds of desirable symmetries. There is no general rule.) 

Comparing the rule (3.25) for the change of a function under a canonical 
transformation with the rule (3.36) for the change of an operator under a 
unitary transformation, we see that —2|F,G] should behave as the Poisson 
bracket {F,G} in going to the classical limit. Therefore the commutator 
algebra of the operators, apart from ordering problems mentioned above, will 
be isomorphic to the Poisson bracket algebra of the corresponding classical 
functions. 

The finite version of (3.36) is 


P= & Fes S (3.37) 
The transformation law for states is given by 
la’) = ela) (3.38) 


Equations (3.37) and (3.38) say that classical canonical transformations are 
realized as unitary transformations in the quantum theory. 

Many useful results follow from (3.36) to (3.38). From the generators 
(3.26) and (3.27) of changes in y, and 7, we find, using (3.36), 


[or(x, ay ds (a t)| ma 0 
[7-(a, t), T(x’, t)] il 
[br (a, t), te(2’, t)] = ¢ 6p, OO @ — 2’) (3.39) 


These give us the basic commutation rules, sometimes called the canonical 
commutation rules, to be imposed on the operators of the theory. (More 
generally, we would have |é*(a, t), €7(x’, t)] = 1(Q-1)4 (a, w’).) 
The generator of time-translations is the Hamiltonian and we get from 
(3.36) 
OF 


is, = (FH) (3.40) 


This is the quantum equation of motion, called the Heisenberg equation of 
motion. 

Using the canonical commutation rules, one can also work out the commu- 
tator algebra of various operators of interest. For example, using expressions 
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(3.34,3.35) and replacing the fields and their canonical momenta by operators, 
we get the operators P,,, M,,, which give the action of the Poincaré trans- 
formations on any quantity in the quantum theory as in (3.36). In particular, 
using the canonical commutation rules, one can check that these operators 
obey the Poincaré algebra commutation relations given in the appendix. 


3.3 Quantization of a free scalar field 


We now apply the rules of quantization to obtain the theory of a free scalar 
field y. The Lagrangian is 


£ =} [(09)? — my" (3.41) 


In the quantum theory, the field becomes an operator ¢(a,t). The canonical 
momentum is 7(x,t) = Oo¢(x,t). The Hamiltonian is 


a ‘i da § |x? + (V¢)? + m2¢?] (3.42) 


The basic commutation rules are 


[o(z, t), o(2’, t)| = 0 
[w(x, t), w(x’, t)] il) 
[o(a, t), x(a’, t)] = 16 (2 — 2’) (3.43) 


The Heisenberg equation of motion becomes, using (3.42,3.43), 
(A+ m?)¢ =0 (3.44) 


The field operator obeys the Klein-Gordon equation. 
Since ¢ commutes with itself, it is possible to choose a ¢-diagonal repre- 


sentation where 
dlp) = 9(x)|y) (3.45) 


Here (a) is some c-number field configuration which is the eigenvalue for 
¢(x, t). In this case, we can write m(x) = —id/dp(x). This is the analog of the 
Schrédinger representation. We can in fact understand the theory by writing 
the Schrédinger equation, which would be a functional differential equation 
in this case, and solving it for the eigenstates of the Hamiltonian. However, 
the diagonalization of the Hamiltonian is most easily done in another repre- 
sentation where we solve the equation of motion (3.44). (Evidently, we are 
also using the Heisenberg picture where operators evolve with time.) The so- 
lutions are obviously plane waves. Choosing a normalization as we have done 
in Chapter 1, we can thus write the general solution to (3.44) as 


(x) = S~[axun(e) + afuz(2)] (3.46) 


k 
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where 
e tke 


me) = aa 


(GeV k? + m?.) (Notice that the u,, uj, appear here merely as mode func- 
tions for the expansion of a general solution of the equation of motion.) The 
fact that we have an ono is accounted for by considering the coefficients 
of the expansion a,, a), to be operators. Notice that since we have a real field 
classically, we need a hermitian field operator and so the coefficient of uj (x) 
in (3.46) must be the hermitian conjugate of ax. By using the orthogonality 
property of the uz(x), u(x) we have 


(3.47) 


a [es uz(xz)(we d+in), al = [ez up(x)(we @-im) (3.48) 


With these expressions, we can obtain the commutation rules for a;, al using 
the fundamental commutation rules (3.43). We find 


[ax, a1] =] 
[aj,aj] =0 (3.49) 


[an, al] = Ori 


The commutation rules for ax, ai are the same as for the creation and anni- 
hilation operators. These rules were obtained in Chapter 2 by considerations 
of the many-particle states. Here they emerge as the fundamental rules of 
quantization for the field ¢(a,t), which is the dynamical degree of freedom. 
The mode expansion for the canonical momentum 7 is obtained from the 
mode expansion (3.46) for ¢ as Oo¢. We can then evaluate the Hamiltonian 

as 
H= ye Lup (axal +alar) =>), [wealan + twa] (3.50) 

k 


Similarly, the momentum operator P; is 
P= [aon = S- 1k (a,a}, + a} ax) = dx kala, (3.51) 
k 


(We have used the commutation rules and > , ki = 0 to simplify the expres- 
sions. Strictly speaking, such expressions have to be defined by regulating 
the sum, which can be done by defining partial sums over N modes and then 
taking the limit N — oo eventually. For the momentum operator, we are 
using a reflection symmetric way of doing this, so that the contribution due 
to k is cancelled by the contribution due to —k.) 

We are now in a position to interpret these results. Apart from the con- 
stant 5wk-term, the Hamiltonian involves the positive operator ata. This is 
positive since (a|alaja) = >, (a|at|B)(Blala) = Ye \(Blala)|? > 0. This can 
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vanish only for a state obeying aja) = 0. The lowest energy state, identified 
as the vacuum state and denoted |0), can thus be defined by 


a,|0) =0 (3.52) 


We see that the vacuum state has energy equal to >>, 5 Wk. This is an (infinite) 
constant contribution to the energy and is a result of the ordering ambiguity 
mentioned earlier. The classical expression does not tell us whether we must 
use al ap or (al ax + axal). Actually the correct quantum operator should 


be wa} ax so that the vacuum has zero energy. This can be seen as follows. 
The operators P,,, M,,, obey the Poincaré algebra. In particular we have the 
relation 

(Ki, Pj] =i dij H (3.53) 


If we have a unitary realization of the Lorentz transformations and if the 
vacuum state is invariant under Lorentz transformations, so that different ob- 
servers see the vacuum in exactly the same way, we have K;|0) = 0, (0|K; = 0; 
the vacuum expectation value of (3.53) then shows that we must have 
(0|H|0) = 0. This implies that H = >, wral ar is the correct expression. 
Thus the requirement of Lorentz invariance of the vacuum can be used to 
choose the correct ordering of operators in this case. Similar arguments can 
be made for the momentum; the correct expression is P; = )°, kaha. 
(For relativistic field theory, the requirement of invariance of the vacuum is 
physically reasonable. In situations where we do not have Lorentz invariance, 
e.g., in special laboratory settings with conducting surfaces or when we do 
not have flat Minkowski space as in the neighborhood of a gravitating body, 
the vacuum energy, or more precisely, the ground state energy, is impor- 
tant and can lead to physical effects such as the Casimir effect or Hawking 
radiation.) From now on we will consider the correctly ordered expressions 
a wralag and P)— >; kal ay. 

The vacuum state has H|0) = 0, P;|0) = 0. Consider now a} |0). We have 


H at|0) =w, al|0), —P, ab |0) = &; alo) (3.54) 


This state has momentum k; and energy w, = Vk? + m?. The relationship 
between energy and momentum is what we expect for a relativistic point- 
particle of mass m, and so we can identify a} |0) as a one-particle state of 
momentum k;. Higher states can be obtained by the application of a string 
of al’s to the vacuum state. An arbitrary state 


— oy OI ig 
ee” af Tipe! \/ Nk, ! 


can be seen, by evaluation of H and P; to be a multiparticle state with 
Nk, particles of momentum k; ( and corresponding energies), Nz, particles 


\0) (3.55) 
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of momentum kp, etc. The /n,! factors are needed for normalization. One 
can also compute the angular momentum of these states and show that they 
are spin-zero particles. The states (3.55) give the full Hilbert space. In this 
version, when the states are constructed from the vacuum by the application 
of creation operators, the full Hilbert space also called a Fock space. 

The N-particle wave function for an N-body state can be defined, up to 
a normalization factor, as 


W (x1, £2, ...2n) = (0|(21)$(x2)...6(an )|N) (3.56) 
where |V) is the N-particle state as in (3.55). For one- and two-particle states, 
W(x) =up(t), Y(r1, 22) = ux, (21)uK, (2) + Un, (Z1)um(z2) (3.57) 


The two-particle wave function is symmetric under exchange of particles, due 
to the fact that a;’s commute. This shows that the particles described by the 
scalar field are bosons. 

In conclusion, through quantization of the scalar field, we have obtained 
a description of spin-zero bosons. We have recovered the many-particle the- 
ory starting from fields as the basic dynamical variables, complementing our 
construction of the field operator from the many-particle approach. 


3.4 Quantization of the Dirac field 
The Lagrangian for the Dirac field is 
L= wWliy-0—m)p (3.58) 
The momentum canonically conjugate to w is given by 
n= it . (3.59) 


One may expect that the commutation rule is of the form [(z), vt (x’)] = 
6‘) (x— 2’), but we shall see shortly that one has to use anticommutators for 
the Dirac theory. 

The Hamiltonian operator is given by 


H = f dab (i978; + mW (3.60) 


From our discussion of the plane wave solutions of the Dirac equation, we 
can write the general solution as 


w(x) on SS 4/ oe [ap,rUr(p)e ?* mi Cp,rVr(p)e?™ | 


P,r 


Be) =F bob tee + ch miele] 
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where E, = \/p* + m?. (We follow the convention of using E, for fermions, 
rather than w,.) The normalization factors in (3.61) are chosen for later sim- 
plifications. The coefficients of the plane wave expansion, viz., Age al a Cpr ale 
are operators in the quantum theory. Using this expansion and the orthonor- 


mality properties of the u and v-spinors given in Chapter 1, 


H = os E, (a), -Op,r = eleene) (3.62) 
Pr 


If we use the canonical commutation rules for y and w1, we find that 
a, at and c, cl! obey the commutation rules for the creation and annihilation 
operators. Equation (3.62) then shows the difficulty of using commutation 
rules. The Hamiltonian is not positive; there are states of negative energy. 
The way to avoid this is to use anticommutation rules. First we redefine 


a Dies Gh = bp,r (3.63) 
If we further assume the anticommutation rules bf Dk, st bi, 30h, . = brsdp,k) 


the Hamiltonian can be written as 


H=)_ E,(ah apr +bh.bpr)— >. 2Ep (3.64) 
Pp 


pr 


The change of sign for the second term is due to the anticommutation prop- 
erty. (Since the fields 7 and 7% involve sums over Gp,r, Cp,r and ae cle 
we must take ap,,, 4 to have anticommutation rules as well, to have com- 
mutation rules for the fields consistent with various physical requirements.) 
We can now define the vacuum state by a,,,-|0) = by,-|0) = 0. The vacuum 
energy (or the zero-point energy) — ae 2E, has the opposite sign to what 
we found for the scalar field. The magnitude per mode is actually the same, 
+ Ep for each of the two spin states of the positive energy solutions and for 
the two spin states of the negative energy solutions. We shall redefine the 
Hamiltonian by subtracting out the vacuum energy, for the same reasons as 
before, viz., Lorentz invariance of the vacuum. The corrected Hamiltonian 
then reads 

H=) Ep(ah apr + bb bp) (3.65) 

Pr 
With the interpretation of a,,,,bp,, as annihilation operators and the vacuum 
defined by ay,-|0) = bp,,|0) = 0, we see that H is always positive. 
The anticommutation rules can be formulated as follows. 


{y(a, t), p(x’, t} = 0, 
iia ime tt = 0 (3.66) 
{ip(a,t), r(w’,t)} =1 6% (a —a') 1 


(The spinor labels are not explicitly shown; the term 1 on the right-hand 
side refers to the identity for spinor labels. Also recall that 7 is iw’, so that 
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these anticommutation rules can be rewritten in terms of vy, wt.) We use the 
standard abbreviation AB + BA = {A,B}. Using the mode expansion for 
the fields, the commutation rules can be obtained in terms of the operators 
a, al, b, bt as 


{ap,r, Qk,o} = {ap,r, bx,s} = 0 

{ap,r,0}.} = {a},-)bk,e} = 0 (3.67) 
{ap,r; a} ,} = brs0p,k 

{bp,r, b| .} = Ors0p,k (3.68) 


The hermitian conjugates of the relations (3.67) hold as well, although we 
do not display them here. These rules are the same as what we obtained in 
the discussion of fermions and subsequent construction of the field operator 
in Chapter 2. 

The momentum operator can be evaluated as 


P; = / dx pt (-4 O:)b = >> pi [aap + Ody r] (3.69) 
P,T 
We also define a charge or fermion number operator by 
Q= i Pa vip = 5° [al ap, — bt dps] (3.70) 
Ps 


(We have chosen an ordering of operators in Q which makes it zero on the 
vacuum.) 

We can now study the states. The vacuum |0) has zero energy, momen- 
tum, and charge. The next set of states are a}, .|0) and 6}, .|0). These have 


energy E, and momentum p;. Since Ep = \/p? + m?, we see that these can 
be interpreted as one-particle states of momentum p; and mass m. The label 
r gives the spin states; these are spin-5 particles. We have seen this in terms 
of the one-particle wave functions in Chapter 1. It follows in our present dis- 
cussion by noting that (0|y(x)a},,,|0) and (O|q(«)b} ,.|0) are the one-particle 
wave functions discussed in that chapter. This result can also be checked by 
direct calculation of the angular momentum. The states a} ,.|0), bh .|0) have 
charges +1 and —1, respectively. We can thus interpret these as the states 
of a single particle and its antiparticle. Evidently, Cs is a particle creation 
operator, bi. is an antiparticle creation operator; their hermitian conjugates 
are the corresponding annihilation operators. 

Multiparticle states can be obtained by applying a string of creation op- 


ol gate = 0. Thus we cannot have more than one particle for every value of 
p,r. This is the exclusion principle. Also we see that the two-particle wave 
function given by 
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Yp,r;k,e(£1,22) = (Olyp(21 (2) |p, 75 k, 8) 


a BV = (e7 #7" e ikea — e 7 Pta eh) uy (p)tte(k) 


(3.71) 


is antisymmetric under exchange of particles. From these two results we can 
see that the Dirac field describes fermions. 

The use of the anticommutation rule was necessary to avoid states of 
negative energy. One may wonder how this ties in with the general idea 
of observables being generators of unitary transformations in the quantum 
theory since unitary transformations, infinitesimally, lead to commutation 
operations. 

Consider, for example, the Heisenberg equation of motion, 


OW 


iF = (Hl (3.72) 


The Hamiltonian is of the form Wthy , h = iy°7'd; + my®. Commutators 
of the form [A, BC] can be written out in two ways, either as [A, BJC + 
B[A, C], which is useful in evaluating commutators for a theory of bosons or as 
{ A, B}C — B{A, C}, which can be used for fermionic theories where the basic 
rules are anticommutation rules. Thus we can have commutation operations 
at the level of operators which are quadratic in (or generally even powers of) 
the basic field variables. Hermitian operators involving even powers of the 
fermionic field operators generate unitary transformations in the quantum 
theory. We have consistency if we require that all observables involve even 
powers of the fermionic field operators. With this condition, our quantization 
for fermions is consistent with the general rule of quantization (3.36). 

The spinor field must be quantized by anticommutation rules, that is, as 
fermions obeying the exclusion principle in order to have a positive Hamilto- 
nian.This is a special case of the more general spin-statistics theorem, which 
states that 


1. Quantization of half-odd integer spin fields or spinors using commutation 
rules will lead to states of negative energy. 

2. Quantization of integer spin fields using anticommutation rules will lead 
to states of negative norm, which do not, therefore, admit a probabilistic 
interpretation, or, in a different version, to lack of Lorentz covariance. 


The consequence is that spin->, -3, -3,... particles must be fermions while 
spin-zero, -1, -2, ... particles must be bosons. Originally proved for relativistic 
theories, this result has been improved over the years. Recently, there have 
been attempts to prove such a spin-statistics theorem based only on certain 
general topological arguments and the existence of antiparticles. 
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3.5 Symmetries and conservation laws 


A symmetry of a classical field theory is a transformation y — y’ under which 
the Lagrangian changes at most by a total divergence. A total divergence 
integrates to a surface term in the action and so does not change the equations 
of motion. Preserving the Lagrangian up to a total divergence is a sufficient 
but not necessary condition for a symmetry of the equations of motion. It 
is possible to have a symmetry of the equations of motion which is not a 
symmetry of the Lagrangian (even up to total divergence). The simplest 
example is nonrelativistic free particle motion described by 


Pe 1 dx‘ dz* 
= ap ‘dl 
d2x* 


We see that any general linear transformation x* — Mis’, where M is an 
invertible constant matrix is a symmetry of the equations of motion, but only 
those M’s which are orthogonal, viz., M7? M = 1 preserve the Lagrangian. 
However, in anticipation of the quantum theory, we shall be interested only in 
symmetries which are canonical transformations; such symmetries preserve 
the Lagrangian up to a total divergence. 

There are many discrete symmetries of interest in physics such as parity 
and time-reversal. We shall postpone their discussion for now and consider 
continuous symmetries. For continuous symmetries, the changes in the fields 
are specified by a continuous set of parameters. (We consider global sym- 
metries for which the parameters are constants, i.e., independent of space- 
time. Local symmetries for which the parameters can be spacetime depen- 
dent will be discussed in Chapter 10 where we introduce gauge theories.) For 
a continuous symmetry it is possible to consider infinitesimal transforma- 
tions which are very close to the identity transformation. i.e., we can write 
Yr > yl = v, + eAda,y. €4 are the infinitesimal parameters of the trans- 
formation; Par is defined by the change in y,. Being a symmetry, we must 
have 

6L— Ole 14) (3.74) 
for some K4. 

Consider a general change y, — y, + dy,. We have 


OL 0 OL Fs) ac 
1 se 5 soe 
Be Ox a) a Ont (soos) (3.75) 
We define a current associated to a symmetry transformation by 
mn OL 


= _ Le 
Ji HO,e.) oo" Kh (3.76) 
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For a symmetry, using (3.74, 3.76), we can write (3.75) as 


| OL Oo OL 


nO We ey ae A pa 
Oy, Ort Fes | Or iG OnJ (3.77) 


We see that Jp evaluated along the classical trajectories is conserved; i.e., 
OJ = 0, if the fields obey the equations of motion. Notice that (3.76) defines 
Ji, even for field configurations which do not obey the equations of motion. 

We have obtained the result that for every continuous symmetry of the 
theory, there is a current which is conserved by the time evolution of the 
fields given by the equations of motion. This result is known as Noether’s 
theorem. 

Integrating 0,,J4 = 0 over all space, we see that the charge 


Qa= / ares or (3.78) 


is preserved in time, i.e., “2 = 0. (This is true if the surface term § J,4-dS 
at the spatial boundary is zero; otherwise, this surface integral tells us the 
rate at which charge is flowing out of the volume under consideration.) 

We now consider some examples illustrating this result. 


1. Consider a complex scalar field y with the Lagrangian 
L= duy"*d"p — mp" (3.79) 


The transformation y —> y’ = e*8y is evidently a symmetry, i.e., L(y’) = 
L(y), for constant 0. Thus 


6p =i0p, dy*=-i0y*, K*=0 (3.80) 
The current is given by 
J¥ = —i[p*(O*y) — (O*y") y] (3.81) 


One can easily check that this current is conserved, using the equations of 
motion. The charge is given by 


a= -i f de (y*x* — ry) (3.82) 
ma and 7* are the canonical momenta for y and y* respectively. 0Q is the 


generator of the symmetry transformation; i.e., for any operator F built up 
of ¢, ¢* and the corresponding canonical momenta, 


i6F = [F,6Q] (3.83) 


2. Consider the Dirac theory with the Lagrangian 
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L= wiry -0—m)p (3.84) 


The transformation » > wy’ = ey, » — w' = ey is a symmetry with 
L(', b') = L(W, ). The current is given by 


J* = byt (3.85) 


The corresponding charge is 
q= [ bx vty (3.86) 


This charge is the fermion number charge we introduced in the last section 
for Dirac particles. From the anticommutation rules for the fields 


[:b(a, t), OQ] = O(a, t), [b(a, t), 0Q] ins —0 (3.87) 


showing that 6Q is the generator of the transformations 7 — e Oy, po 


ety, 


3.6 The energy-momentum tensor 


Symmetries we have considered in the examples given above leave the La- 
grangian unaltered. Generally, in the case of spacetime symmetries, the La- 
grangian changes by a total divergence; there is a nonzero K/,. The spacetime 
symmetries of interest to us are translations and Lorentz transformations 
which lead to the conservation of energy, momentum, and angular momen- 
tum. Rather than use the formula for the current, we work out once again 
the derivation of the conservation laws. 

We consider the transformation 24 — x4 + €4, with €4 = a4 + w’z,, 
which corresponds to a constant translation by a” and an infinitesimal 
Lorentz transformation with parameters w”; the vector £4 obeys 0,€4 = 0. 
The change in the fields is given by 


a 
Opr = EP O,pp — gu (Suv ¥)r 
Bur = EM Oyior + ub” Air — 31d"? (Sapp)r (3.88) 


Here S,,, are the spin matrices, whose explicit form is given in Chapter 1 and 
the appendix for spin-5 and spin-1 cases. In (3.88), they are understood to 
be in the representation to which the fields y, belong. 
The Lagrangian is a scalar function of x; the change in L is thus given 
ma OL 
OL = Oh an = OnlE*L) (3.89) 
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The Lagrangian does not have explicit dependence on the coordinates; the 
dependence on x” is through the fields and their derivatives. So we can also 
write, as in equation (3.75), 


OL 
6L = E,6 T + OL (5 ‘WesD) 3.90 
’ ¥ ? O(OL¢r) ( ) 
OL 6) OL 
é. = ——- - =— | ——— é 
Ba teas ethan’) ee 
Combining equations (3.88 - 3.91), we have 
0 OL i OL 
pai Le = See eee mae de 
Ox (¢ OnPr 56-3) e J 5 Sa TG) EO pr 
(3.92) 


We see that the quantity in the square brackets on the left-hand side is 
conserved for fields which obey the equations of motion, viz., when €, = 0. 
The current for translations is given by 
OL 

tay = OuPr F5r,) — Muv£ (3.93) 
(This is not quite the energy-momentum tensor, which is why we use the 
lowercase letter.) There is some arbitrariness in defining t,, from the conser- 
vation condition; one can add a term like 0° Byov, where Byav is antisym- 
metric in a and v, to ty,. This does not affect the conservation condition. A 
specific choice for Bar as a function of the fields and their derivatives will 
be made below, motivated by symmetry properties. 

The four-momentum of the system is given by 


IE | dr tuo (3.94) 


From equation (3.92), the current for Lorentz transformations, viz., the den- 
sity for angular momentum, is defined by 


Lu!®@Myoy = wh” [rat — $(Spa | (3.95) 


Since only the product of w“* and M is defined, there is some freedom in 
the identification of the density Mav. We now define 


i OL aL OL 
Brow = 9 (Sune) (Or) an (Sywee)s O(0a%r) "a (Seat) mi 
3.96 


Notice that B, ov is antisymmetric in a and v. The angular momentum den- 
sity, consistent with (3.95), is defined as 


Mayav == Latyy aa Lutov a (Buav eam Baw) (3.97) 
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The energy-momentum tensor is now defined as 
Tw = tv + O” Bray ’ (3.98) 


From the antisymmetry of Byov and the conservation of t,,, it follows that 
Ty» is conserved. Further, the four-momentum can be written as 


Pas if dx T.0 (3.99) 


since the term involving B gives a surface integral at spatial infinity, which 
is zero for fields which vanish there or obey appropriate periodic boundary 
conditions. We can also write the angular momentum density as 


Myav = Lol yy ae Lplov or 8? (to Busy a Daley ) (3.100) 


The angular momenta and boost generators are given by 
Me pe M ico fe (Zot o — Fete) (3.101) 


The divergence term in (3.100) leads to a surface integral at spatial infin- 
ity and does not contribute in M,. for B’s which vanish sufficiently fast. 
Therefore we may also define the angular momentum density as 


Muav al Lal = Culov (3.102) 


From the conservation of the angular momentum density, it follows that 
Ty is a symmetric tensor, when the fields obey the equations of motion. 
Our choice of Byov, which is the ambiguity in defining the currents from the 
conservation equation (3.92), is motivated by this symmetry property. The 
tensor B, av is sometimes referred to as the Belinfante tensor. In working out 
the expression for the tensor T),, from (3.98), one may encounter, depending 
on the specific theory, terms which are proportional to €,. Classically such 
terms are clearly irrelevant; quantum theoretically, one needs further physical 
requirements to define such terms; these physical requirements are related to 
the renormalization of the energy-momentum tensor. 


3.7 The electromagnetic field 


An example of a field with spin for which the above construction of the 
symmetric energy-momentum tensor can be applied is the electromagnetic 
field. The basic field variable is the electromagnetic vector potential A,,(z). 
The field strengths are given by the tensor F),, = 0,,A, — 0, A,. The electric 
field E; and the magnetic field B; are given by E; = Fo; and B; = 46:5, F%*. 
The Lagrangian is given by 
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=-iF"F, (3.103) 
This gives 
OL 
— fzRAT 
O(0, Aa) 

tuy = —O,AaFy* + 3nuF? (3.104) 

The Belinfante tensor is given by 
Buav = SA xy (3.105) 


and the symmetric energy-momentum tensor is easily seen to be 


Bo —Piak* Agr" _ (3.106) 


3.8 The energy-momentum tensor and general relativity 


We have seen that the energy-momentum tensor Ty is symmetric. For fields 
with spin, this is achieved after the addition of the Belinfante term to t,v. 
This term arises from the rotation of coordinate frames which are needed 
for defining the components of a vector or tensor, or generally for the com- 
ponents of any field with spin. General coordinate transformations, as en- 
countered in the general theory of relativity, include these frame rotations, 
and therefore we have another way to derive the energy-momentum tensor. 
We first make the action invariant under general coordinate transformations 
by including the metric tensor to carry out contractions of spacetime indices 
and to make the integration measure invariant. In this expression, the frame 
rotations are compensated by the change of the metric components. The 
energy-momentum tensor can thus be obtained by varying the action with 
respect to the metric components. Specifically, the formula is easily seen to 


be 
6S = 5 | v=3 d‘x DOG (3.107) 


In this expression, g = det(g,,) and g*” is the inverse to gyv, ie., g’” Qua = 
64. Since the metric tensor g,, is symmetric, the energy-momentum tensor so 
derived is automatically symmetric and coincides with the symmetric energy- 
momentum tensor obtained via the Belinfante addition. After identifying T,,, 
we may set gy, = Nuv to obtain the result for flat spacetime. The derivation 
of the equations of motion for gravity from an action principle involves the 
variation of the action with respect to the metric. Thus the above formula is 
also just what we expect from the fact that the energy-momentum tensor for 
matter fields acts as the source for the gravitational field. 

As an example, the action for the electromagnetic field is easily covari- 
antized as 
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oi -7 [va d4x gq’? Fy Fug (3.108) 
Using 6./—g = —i/-9 guvdg'”, we find 
OL 1 2 
Ty = —FPraky, aaa {IE 
1 
— —Fyg Fo + qk? (3.109) 


Similarly, for the scalar field we find 


1 
S= [vs d‘z 5a" .v0.v - ame (3.110) 
which leads to 
1 
Ty = On pOv'p — 5 Mu [(d¢)? — mp7] (3.111) 


These expressions coincide with those which were derived previously in a 
more tedious fashion. 


3.9 Light-cone quantization of a scalar field 


A simple example which illustrates the use of the symplectic structure 
2; (a, x’) is the light-cone quantization of a scalar field. We consider a real 
scalar field y with a Lagrangian 


L= 35(d9)? — U(y) (3.112) 
where U(y) = $m?y? + V(). 


We now introduce light-cone coordinates, corresponding to a light-cone 
in the (z, t)-direction as 


= al +t) 
ul 
i= a —t) (3.113) 


Instead of considering evolution of the fields in time t, we can consider evo- 
lution in one of the the light-cone coordinates, say, u. The other light-cone 
coordinate v and the two coordinates x7 = x,y transverse to the light-cone 
parametrize the equal-u hypersurfaces. Field configurations y(u,v,z,y) at 
fixed values of u, i.e., real-valued functions of v, x,y, characterize the trajec- 
tories. They form the phase space of the theory. The action can be written 
as 
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= / du dv Bx? [d,g0, + }(Ory)? + U(y)] (3.114) 


A naive definition of the canonical momentum 7 as Wea} gives 7 = 


—O,, which is not independent of y(v,27). Such a definition is therefore 
not very useful. However, from the variation of the action S, we can identify 
the canonical one-form O as 


O= | dv d’x™ (—dvy dy) (3.115) 
The symplectic two-form is thus given by 
Q(v, 27, v',2'7) = —2 8,6(v — v')6) (27 — x'T) (3.116) 


We need the inverse of (2. Writing 


3 
d(v—v') 6% (27 — z'T) = / i, exp (—ipu(v — v’) — ip” - (27 — 2'7)) 


(27)8 
we see that Senet 
Q(v,27 ,v',2'7) = ; oe. exp(—ip,(v — v') — ip? - (a7 — 2'?)) 
= —Felv Saeetiat — 2!) (3.118) 


Here €(v — v’) is the signature function, equal to 1 for v > 0 and equal to —1 
for v < 0. 

The phase space is thus given by field configurations y(v,z7) with the 
Poisson brackets 


{yp(u,v, 27), p(u,v’, 2’? )} = — Flv ae oe ee ) (3.119) 
The Hamiltonian for u-evolution is given by 
H= / dv d’x" [5(@ry)? + U(y)| (3.120) 
The Hamiltonian equations of motion are easily checked using the Poisson 
brackets (3.119). 


Quantization is achieved by replacing y by an operator ¢ with commuta- 
tion rules given by i-times the Poisson bracket. 


3.10 Conformal invariance of Maxwell equations 


Translations and Lorentz transformations (or Poincaré transformations) are 
not the only kind of spacetime symmetries possible. An example of another 


40 8 Canonical Quantization 


spacetime symmetry of interest is conformal symmetry. The free or sourceless 
Maxwell equations are invariant under conformal transformations. 

Lorentz transformations are of interest. because they arise as isometries of 
the Minkowski space. (We give a brief discussion of isometries and the Killing 
equation in the appendix.) The change of the metric or distance function 


ds* = 9,,dx"dz” ~~“ (3.121) 


under an infinitesimal transformation «4 — x’ + €" is given by 


og” 0g O9uy 
O9uv = Java + Gua Day + er (3.122) 
Setting this to zero, we get the Killing equation. For any given metric, the 
solutions of the Killing equation give the isometries. For the Minkowski metric 


which is constant, the Killing equation becomes 


Ouév + OE, =0 (3.123) 


The most general solution, as noted in the appendix, is given by &, = 
Ay + Wyre”; i.e., the general isometries for the Minkowski space are trans- 
lations and Lorentz transformations (including rotations). This is why these 
transformations are important for field theories on Minkowski spacetime. 

Conformal transformations preserve the metric up to a scale factor; i.e., 
the change in the metric tensor is given by Ag,, where A is a scalar. In this 
case, ds? — (1+A)ds?. The propagation of light rays is given by ds? = 0 and 
this condition is preserved by conformal transformations. For this and many 
other reasons, conformal transformations are also important in physics. From 
(3.122), we see that conformal transformations are given by 


C020 10 “Fs GepOne. 5 Oueere = Aguv : (3.124) 


This is the conformal Killing equation. For the Minkowski metric this sim- 
plifies to 


On€v + WEp = AN (3.125) 


The contraction of this with n“” gives \ = 5 Ope". The conformal Killing 
equation becomes 


Ou + Ey — 5 (Oae”) te 0 (3.126) 


The most general solution is given by 
Ey = Oy + Wyy a” + bY (a? Quy — 2yty) + € Lp (3.127) 


for constant a,, Wyv, 6”, €. The first two sets of parameters correspond to 
translations and Lorentz transformations as before. The transformations cor- 
responding to the parameters b” are called special conformal transformations 
and the transformation corresponding to « is called a dilatation. The special 


3.10 Conformal invariance of Maxwell equations 41 


conformal transformations may be understood as follows. Define an inverted 
coordinate y4 = (x"/x*), « # 0. An infinitesimal translation of y” as given by 
y* — y* +b" can be easily checked to be the special conformal transformation 
when it is expressed in terms of x“. 

Now consider the change of A,, and F),,, under a coordinate transformation 
oh =P Eb ovine 4 


OA, = €7 Fay + Ou(Ac€*) 
OF yy = €*0aF yy + FavOuE* + Fucdlé* (3.128) 


The change in the Lagrangian (3.103) is given by 
eae i 
OL = O,(LE") — af FQ |Onév + En — 30 »€) uv (3.129) 


For a conformal transformation which obeys (3.126), we see that the change 
in the Lagrangian is a total divergence. The equations of motion, which are 
the sourceless Maxwell equations, therefore are invariant. In other words, the 
sourceless Maxwell theory has conformal symmetry at the classical level. 

The conserved current for these symmetries may be obtained using (3.75). 
Since K¥ = LE, we find 


OL 
SA Or | — KP = €% [—- FY FA, — LOE] — FY O,(A- 
(a,05°"") | | (A-8) 
= €°T) — FW 0,(A-€) (3.130) 
Equation (3.75) becomes 
8, (€°T# — F#YO,(A-€)] 0 (3.131) 


where the sign = indicates that equality holds when the equations of motion 
are used. Notice that 0,[F"”0,(A -€)] = 0 by itself upon using the equations 
of motion for F“”, so that we have the conservation law 


6,[€°T#] ~ 0 (3.132) 


The conserved currents for various transformations may be obtained by sub- 
stituting (3.127) in €*T¥ and taking the coefficients of the parameters of the 
transformations. For example, the dilatation current is given by 


JY = 2°Th (3.133) 


The change 0,,(A - €) is in the form of a gauge transformation; it is not 
a change in the physical field configurations and can be dropped from these 
considerations. This is why the term F“¥”0,(A-&€) may be removed from 
the expression for the current. It also follows from (3.107) and (3.108) that 
T} = 0, since 6S = 0 for guy — Agu. 
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4 Commutators and Propagators 


4.1 Scalar field propagators 


As a prelude to the discussion of interactions we calculate commutators and 
propagators. 

Consider the theory of the scalar field first. The field ¢(x) has the mode 
expansion 


(x) =D agun(x) + aluj(z) (4.1) 
k 
where 
- etkr , 2) 
uke = av So 


(wp = Vk? + m2). The commutator [¢(z), (y)] can be directly calculated 
as 


[d(z), oy] = D_ wn(w)uj(y) — uf (x)ux(y) 
k 


Bk 1 (ike tk(a— 
Saar (c (z—y) __ pik( »)) (4.3) 


in the limit V — oo. We can rewrite this as 
d*k —4 oe 
[d(x), o(y)] = [ (nyt (a=) ee) = AGa,y) (4.4) 


where the contour is shown below. 


Fig 4.1. Contour for the commutator 
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Equation (4.4) is to be interpreted as follows. We carry out the ko-integration 
first, as a contour integral in the complex ko-plane, along the contour shown. 
The contour encloses both the poles of function (k? — m?)~!, which are at 
ko = +w,;. Equation (4.4) then reproduces the expression in (4.3). 

The commutator A(z, y) is a Lorentz-invariant function of the proper 
distance (x — y)*. If 2° = y°, the commutator vanishes since the fields com- 
mute at equal times. Since it is Lorentz invariant, it will thus vanish for all 
points which are Lorentz transforms of (x°,a), (x°,y), i.e., for all spacelike 
separations of the points x, y or equivalently for (x — y)? < 0. 


[o(x), o(y)] = 9, (x—y)’ <0 (4.5) 


¢@ is a hermitian operator and qualifies as an observable. The fact that the 
fields commute at spacelike separations tells us that it is possible to measure 
¢ at two points with no uncertainties if the two points are spacelike separated. 
This is a reflection of the fact that, as with any signals, the disturbances due 
to the measurement process cannot travel faster than light. 

Notice that if we use only the u,’s in defining a field operator, say, x(x) = 
>, GkUK(x), then we do not have [y(x) + x'(z), i(x(y) — x1 (y))] = 0 for 
spacelike separation of x,y. Thus we cannot interpret arbitrary hermitian 
combinations of x and x? as measurable quantities and be consistent with 
relativity. This is another reason that ¢(z) = )°>, axup(z) + al ut (zx) is the 
appropriate field operator in the relativistic theory. 

Green’s functions for the Klein-Gordon operator will be important for the 
discussion of interactions. We define them generically by 


(O+ m?) G(z,y) = -16 (x — y) (4.6) 
The solution can be written as 
d*k i _ik(e— : 
G(z,y) = (Qn (aa) Ease (4.7) 


The choice of contour in dealing with the singularities at kg = tw, will de- 
termine the type of Green’s function we have. The contours for the advanced 
and retarded functions are as shown below. 

Consider the retarded contour Cr. For x° > y°, we must complete the 
contour in the lower half-plane so that the exponential e~!/™*ol(#°—v°) wil] 
ae that the large semicircle will not contribute. We thus get G(z, y) = 

R\Z,Y); 


ako 
(27)3 Qu, 


=0; r<y? (4.8) 


Gr(z,y) = eo ew) ee), 2 > y9 
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Fig 4.2. Contours for retarded, advanced and Feynman propagators 


In other words, 
Gr(z,y) = (2° — y°)[9(z), o(y)] (4.9) 


where 6(x° — y°) is the step function. A similar result, Ga(z,y) = 0(y® — 
x°)A(y, x), can be obtained for the advanced Green’s function. 
Another quantity of interest is the Feynman propagator. It is defined by 


G(z,y) = (0/T o(x)d(y)|0) (4.10) 
where time-ordering, denoted by 7’, is defined by 
T9(x)G(y) = O(2° — y°)d(z)o(y) + O(y° — 2°)d(y) (x) (4.11) 


The T-symbol orders the operators on which it acts and rearranges them 
such that the operator with the latest time-argument is at the left, the one 
with the next latest time-argument comes next, and so on , with the operator 
with the earliest time-argument at the right. Using the expansion for ¢(z), 
we find 


G(,y) = >_ [0(2° — y°)ux(x)uz(y) + O(y? — 2°)un(y) uz (2)] 
k 
_f @k 1 
~ SJ (2m) Que 


[a(a° — y0)e7th(e—v) + O(y? — ie 
(4.12) 


In terms of contour integrals, we can write 


dk i ~ik(2—y) 
= o 41 
G(a,y) Qn)4 € —m?2 + =) Gy 


The contour is the real axis (and a suitable completion in the upper or lower 
half-plane). € is a small positive number, with « — 0 eventually. It shifts the 
poles as shown. The pole at ko = wx contributes for 2° > y°; the pole at 
ko = —w, contributes for y° > 2°. 

The propagator obeys the equation 
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(+ m? — ie)G(a, y) = —16 (a — y) (4.14) 


so that it is a seb function in the mathematical sense. G(x, y) is the 
inverse of 1(1]+ m? — ie). 

The propagator can be interpreted as the amplitude for particle ‘einir’ 
gation. To see this, let us define a function K(z,y) as follows. For oo ee, 
K(z,y) is the amplitude for the particle to propagaté from y to x in time 
(2° — y°). For y° > x°, since x° is the earlier time, the particle must propa- 
gate from =x to y; i.e., K(x, y) is the amplitude for this propagation. In terms 
of x-diagonal states in quantum mechanics, we can write 


K(a,y) = (x, 2°|y,y°), x? >y® 
= (y,y° |x, 2°), yo >a (4.15) 


Thus 


K (2,9) = (2° ~ y°Ywle BW )iy), + 6(y? = 2°Yagfe O27) 
(4.16) 


For the relativistic particle, H = ./p?+m?. We can insert momentum 
eigenstates |k) using the completeness relation (1.7) from Chapter 1, where 
p|k) = k|k) and (a|k) = exp(ik- ax). The eigenvalues of H are wz and we can 
evaluate the above expression to find K(x, y) = G(z, y). 

The propagator can thus be interpreted as the function which, for 2° > y° 
gives the amplitude for particle-propagation from y to z, and for y° > x° gives 
the amplitude for particle-propagation from xz to y. (As mentioned before, 
we should use both u(x) and uj(x) to define the field (x); this is the 
only combination which is physical in the sense of having [¢(x), ¢(y)] = 0 for 
spacelike separated x, y. Thus the definition of the propagator in terms of the 
field ¢(x) will include both cases, 2° > y° and x° < y°.) The interpretation 
of the propagator given here generalizes to N-point Green’s functions, as we 
shall see later. 

The analyticity of the integrand in expression (4.13) tells us that it is 
possible to deform the contour of integration for the propagator to lie along 
the imaginary ko-axis. There is no crossing of poles of the integrand in this 
deformation. We may thus write 


G(x,y) = lr 7 dko = Gn z( aa) e tay (4.17) 


Introducing ko = tka and x° = iz*, this equation can be written as 


G(z,y) = Ga(z, v)| (4.18) 


2t=—ig? y4——ijy0 


Gr(z,y) = atk _ i eik(z—v) (4.19) 
i (2r)4 \ k2 + m2 : : 
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where we integrate over the real line for all k’s including k, and k? = k-k+ 
ki, k(c—y) =k-(@—y)+ka(x* —y*). The metric used in (4.19) is thus the 
standard Euclidean one. The propagator can be considered as the analytic 
continuation of the Euclidean Green’s function G(x, y) to imaginary values 
of r+, y*. Ge(zx, y) obeys the equation 


(—Oe + m?)Gx(z,y) = 6 (x — y) (4.20) 


Thus Gz(z, y) is the inverse to the operator (—Og + m?). 

The propagator we have defined describes the probability amplitude for 
the propagation of a single particle. This is clearly a quantity of physical in- 
terest, for if the particle undergoes interactions either with an external field 
or with other particles during the course of its propagation from (y°, y) to 
(x°, x), this will affect the probability amplitude for the propagation. The 
calculation of the propagator (0/T(¢(x)@(y))|0), suitably generalized to in- 
clude interactions will then capture the effect of the interactions. Scattering 
amplitudes, for example, will be directly given by the propagator. 

A generalization of the propagator to the many-particle case can be easily 
made. The quantities of interest are the N-point functions defined by 


G(x1, £2,°++, ny) = (O/T (9(21)P(a2) --- 6(zw))|0) (4.21) 


As an example, consider the 4-point function G(x, 2, 23, 24). In the limit of 
x9, 23,23 — oo and x? — —oo, we have one-particle in the far past and three 
particles in the far future, corresponding to the process of a particle decaying 
into three others. G(z1,2%2,23,24), with these assignments of time-labels, 
gives the probability amplitude for such a process. Likewise, in the limit of 
x?,x23 — oo and 29, x9 — —oo, we have two particles in the far past and two 
particles in the far future, and the corresponding G(x, 22,23,24) gives the 
amplitude for two-particle scattering. Similarly, the N-point functions give 
amplitudes for a variety of physical processes. It is clear that the N-point 
functions are the quantities of interest. 

A succinct way to describe all the N-point functions is to collect them 


together into a generating functional Z[J] defined by 


1 
ile > 7 [tadinr dey G(a1,22,°-+,@n)J (21) J (£2) ++: J(zw) 
<= N! 


= (017 |exn( [ 76)} 10) (4.09) 


where J(x) is an arbitrary function of the spacetime coordinates x. J is not 
an operator. It is often referred to as a source function. By expanding Z[J] in 
powers of J we can easily recover all the N-point functions as the coefficients 
of the expansion. Alternatively we may write 


Gigi to oN) eames bay) Z( J] (4.23) 
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We now derive an equation obeyed by Z[J]. The basic ingredients for this 
calculation will be the operator equations of motion and the canonical equal 
time commutation rules. We start with the quantity 


5Z(J] 
5J(x) 


= wif eu)sue) --- S(yn)(0|TO(z)b(y1) b(y2) --- (yn )|0) 
N ’ 


= (0|T¢(z)eS 7*\0) (4.24) 


Consider applying 0, on this quantity. The space-derivatives go through 
the time-ordering symbol and act on ¢(z). The time-derivatives can produce 
extra terms. This can be seen as follows. From the definition of time-ordering, 
we can write 


M dy dP yJ MO? dyad yt é 
(o|To(a)e! 7#\0) = loleln- weve, a ae ayy IO 400) 


t’ 
033 1 0 33 M-1 043 
oe ayy fy sc alee dy°d*yJd 


|0) 
- (0|TeJ=0 ay a¥S6 gs nyTed on dy"dyJ$ 9) 
= (0|P(0o, z°)¢(x)P(a°, —00)|0) (4.25) 


where in the first equation we have divided the time interval into 2M in- 
tervals, labeled by t;,t;. Eventually, M — oo, with the intervals shrinking 
to zero, as in the usual definition of the integral. The second step isolates 
the x°-dependence in the appropriate order. We have also made the notation 
compact by defining 


P(z°, y°) =T exp ( / : d‘a 26) (4.26) 
This has the property 
a opts) = = f Ba J(z°,x)d(2°, a) P(z°, y°) 
seh) = —P(2',y) fda Jy? 2)6W?,2) (427) 
Taking the time-derivative of (4.25) and using (4.27), we get 
50 1LS(a)eF 0) = (01P(c0, 2°) 92D) p(2®, -c0)I0) 
+/ eee 2°) | 662) 6,0) 
- (2°, y)6(2)| P(2®, -oo}[0 (4.28) 
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The equal time commutator [4(r°, a), ¢(x°, y)] is the extra term we have; 
this is zero by the canonical commutation rules and so we may write 


sq IPOCe)e! 7*I0) = (0) (c0,2°) ES) Pla, —ooylo) (4.20) 


In a similar fashion, the second time-derivative becomes 


2 
Base OUTH(z)e! 7*I0) = (O1P(00,2°) 2) (a9, -cc)I0 


(82°) 
: / dy J(2°, y)(0|P(co, 2)[4(2°, x), 6(2°, y) 
P(2°, —co)|0) (4.30) 


The extra term [¢(z°, x), ¢(x°, y)] is —i6@)(ax — y). Using this, we find 
3° 4(z) 
(Ox)? 
~iJ(x)(0\Ted 74 |0) 


= (O|T aoe J "41 10) —iJ (2) Z[J] 


(0|To(x)ed 74\0) = (0|P(co, 2°) 


O? 
(Ar (Ar°)2 Bias —00)|0) 


(4.31) 


Using this equation we find 


(Ce +m?) 2 a 4 (O|T (Ce +m?)d(zx)ed és ~iJ(x)Z[J] (4.32) 
So far we have used the canonical commutation rules to simplify the above 
expression. The operator equation of motion (0+m?)¢ = 0 for the free scalar 
field can now be used to obtain the equation 


9, 6Z[J] 


5a) = —iJ(2)Z{J] (4.33) 


en) 
This is the equation of motion for Z[J]. All the N-point functions may be 
obtained by solving this functional equation. The solution is actually quite 
easy to write down; it is given by 


Z[J| = N exp E [ated'y HayGla,n Jw) (4.34) 


where NV is some quantity independent of J. The definition of Z[J] = 
(0|T exp(f J¢)|0) shows that we must have Z[0] = 1. This fixes the normal- 
ization factor VV to be 1. Expression (4.34) is easily verified to be a solution 
to (4.33) provided (O, +m?)G(a, y) = —i5 (x—y). A priori, there are many 
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Green’s functions G(x, y) which could be used here. However, from the solu- 
tion (4.34), and using the definition of Z[J] expanded to quadratic order in 
the J’s, we find that (0|T(¢(r)¢(y)|0) = G(z,y). This identifies G(x, y) in 
(4.34) as the Feynman propagator, or equivalently the boundary conditions 
for inverting the operator (1+ m”) have been specified. 

The N-point functions G(z1,---,2y) can be written down explicitly by 
using (4.23) and (4.34) as 


G(x, ae , Tan) = ys G(zi, ’ wis )G (ais; i, ) Pek Ce ’ Tae) (4.35) 
iP 


where the sum is over all pairings of the coordinate labels 71, X2,--:, Zan. 


4,2 Propagator for fermions 


The field operator for spin-$ particles (fermions) is given by 


va) = Ss \ nag (ene u,(p)e—*?? + bf .v,(p)e”*| (4.36) 
Pst 
Joo [at .tir(p)e* + bpd, (p)e*?* | (4.37) 
Js E,V ” 


where the creation and annihilation operators obey anticommutation rules. In 
the case of the scalar field, the time-ordered product T[¢(x)@(y)| was defined 
as ¢(x)¢(y) for x° > y® and 4(y)¢(z) for y° > 2°. We get the same expression 
as x° — y° from above or below, because the ¢’s commute at equal time. In 
the case of fermions, we have anticommutation rules at equal times, and, in 
order to get the same expression as z° > y° from either side, we must define 


reewad={ Soe eau 
For 7 and 7, we have similarly 
Tew) ={ See tne 8s (4.38) 


The propagator is defined by 
S(x,y) = (0|TY(x)b(y)|0) 


m™m Ps 
= —— } |0(2° — y°)uspiispe P2-¥) 
2 Go [2(2? — 9 )uaptep 


— O(y° — 2 Wantage] 
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ng 1 0 0 —ip(xz— 
= (say) (2? -v0-p + myewew 
— Ay? — 2°)(-y- p— m)ei(2-»)] 
= ae = ” —tp(x—y) 
(on)3 2B, [o(a y°)(y-p+mye 


— 6(y° = 2°)(y-p— mje] 


(4.39) 
(as V — 00). We have used the properties 
(y:p+m 
Y ptog = SS ipiashct 70) 
(y:p—m 
ae = = OE i) (4.40) 


The propagator (4.39) can be written as 


S(x,y) = (0|TY(x)p(y)|0) 
ap Y:PtM  _infe-y) 
(21)4 p? — m? + ie 
= (iy-0+m)G(z, y) (4.41) 


We also have (0|T(x)(y)|0) = (0|T(x)p(y)|0) = 0. 
The fermion propagator (4.41) is easily seen to obey the equation 


(iy -8—m)S(ax,y) = 16 (a — y) (4.42) 


The propagator is thus the inverse of the operator (iy-0—m). 


4.3 Grassman variables and generating functional for 
fermions 


In defining multiparticle propagators for fermions, it is again useful to collect 
them together into a generating functional. Recall that the important char- 
acteristic of fermions is that they obey the exclusion principle; there cannot 
be double occupancy of states. This is encoded in the anticommutation rules 
obeyed by fermionic fields. The definition of time-ordering for fermionic oper- 
ators, viz., (4.38) also reflects this. Appropriate source functions (the analogs 
of J) for collecting together the multiparticle propagators and the functional 
differentiation with respect to them is then provided by anticommuting c- 
number functions or Grassman variables. A Grassman number 7 has the 


property 
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n° = 0, 1 #0 (4.43) 


In other words, a Grassman number is nilpotent. One could, if desired, give 
an explicit realization of 7 as 
OP: 
= 4.44 
7 a _—. (4.44) 


where p is a real number. When there are many Grassman variables, this 
can lead to considerable notational complexity. Fortunately, for our purpose, 
such explicit realizations will not be necessary; the algebraic structure is all 
we need. We can describe a number of Grassman variables 7;, i = 1, 2,...,N 
by 

mn; + 15m = 0 (4.45) 


More generally, one can also consider Grassman-valued functions 7(z). (« may 
be thought of as a continuous version of the index 7.) For Grassman-valued 
functions, we have 


n(x)n(y) + n(y)n(z) = 0 (4.46) 


The product of two Grassman numbers (or generally an even number of such 
numbers) behaves like an ordinary c-number function or bosonic variable in 
its commutation properties. 

One can also define functions of a Grassman variable. These are defined 
by a Taylor series expansion in the Grassman variable around zero. The series 
is always finite for a finite number of Grassman variables since the square of 
a Grassman variable is zero. For example, for functions of a single Grassman 
variable we may write 


f(2,n) = fo(x) + nfi(z) (4.47) 


or if we have N variables n;, i = 1,2,..., N, 
iE 
f(z,n) = fo(x) + De fi(z)ni + rT Dy fig (@)ning +: 
a ay 


i 
ta NI ss diieean (x) ni, Tig ->-Tin (4.48) 


* diig...tn 


Notice that fj, 72... must be antisymmetric under exchange of any two indices 
since the product of the 7’s which multiplies it has this property. 

One can define differentiation of such functions by making a variation 
of 7, bringing the resulting dn; to, say, the left end by making use of the 


antisymmetry property on the indices and then defining the coefficient of 67; 
as the derivative. We find 


a 1 
a = f(x); + S> fits Nin ole geeeate (V-D)! SS F(©)éiz...in Nig Nin (4.49) 


42 12...tN 
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Differentiation has the property 


a en 
dn; On; Ong Oni 


Functional differentiation can likewise be defined by a power series expansion 
in (x) and obeys the rule 


ne Se 
On(z) On(y) ~—- An(y) An(z) 


The notion of Grassman numbers and variables may not be as intuitive 
or as natural as our notion of real and complex numbers, owing to their 
nilpotent nature. Nevertheless, the above rules for Grassman variables form 
a consistent set of algebraic rules for doing calculations and this suffices for 
the purpose we have in mind. 

We now return to fermionic fields. The propagator for fermions is given 
by S(x,y) = (0|/Tw(x)wW(y)|0), which obeys (4.42). The many-body fermion 
propagators can be discussed by introducing a generating functional 


(4.50) 


(4.51) 


Zin] = (OT exp ( / m+n) 0) (4.52) 


where we have introduced Grassman-valued functions 7, 7 in a way analogous 
to J for bosonic fields; namely, one can calculate the N-point functions for 
fermions by differentiating Z|n,7] an appropriate number of times and then 
setting 7,7 to zero. The sources 7,7 are spinors and have to be Grassman- 
valued. We take them to anticommute with 7,7 as well. The advantage of 
introducing such sources is that we can write 


Ti(x)p(x)b(y)n(y) = ni(x)d(x)d(y)n(y), o> aye 
= o(y)n(y)n(z)v(2), yo >a 
(4.53) 


Thus the combinations 7j()y(x) and w(y)n(y) behave like bosonic operators 
for time-ordering. The additional minus sign which was introduced for the 
time-ordering of fermionic operators is taken account of by the Grassman- 
valued sources. We can now write 


82 
6n(x) 


Acting on this with the operator (iy -0—m), we get 


= (0\Tu(a)el ¥+¥"/0) (4.54) 


(iy-8—m) = (OT (iy-8 — m)v(a)ed *7/0) + 


OZ 
d(x) 
in? [ a y(O(TWd(a), (aH(u) + dn(w))] ef +n) 
(4.55) 
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The first term on the right-hand side vanishes by the Dirac equation for 7. 
The commutator in the second term, which is an equal time commutator, can 
be evaluated by the canonical anticommutation rules and gives nd) (a2—y). 
The above equation then simplifies to 


(iy-0-m)—~ = in(2)Z..,-- (4.56) 
én a ) 
The solution to this equation is given by 
Zin] = exp | f dled'y n(2)S(eu)n(y) (4.57) 


where we have chosen the normalization to agree with Z{0, 0] = 1. 
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5 Interactions and the S-matrix 


5.1 A general formula for the S-matrix 


In this chapter we begin the description of interacting field theories. The case 
of the scalar field theory with an interaction of the form ¢* will be treated as 
an example. The formalism is, of course, more general and is easily extended 
to any polynomial interaction. 

The Lagrangian for the theory we are considering is thus 


£=1[(09)? — m2¢?] — ro# (5.1) 


Here is a constant; it is a measure of the strength of the interaction and is 
referred to as the coupling constant. The equation of motion is given by 


(CO, +m?) (x) + 4A4¢3(x) = 0 (5.2) 


In the quantum theory, ¢(z) is an operator on a Hilbert space and the above 
equation is an operator equation of motion. In addition to the equation of 
motion, we also have the canonical commutation rules 


[d(a°, x), (2°, y)] = 0 
[r(x°, x), (2°, y)] = —i 6) (x — y) (5.3) 
[r(x°, x), r(x°, y)] =0 


where 1(x°, x) = Oo¢(x°, x). If the interaction term 4\¢° is set to zero, then 
¢(x) is a free scalar field and one gets the standard many-particle description. 
In this case, the commutation rules and the equation of motion show that 
¢(x) can be written as (x) = D> agug(x) + al ux (x), where ax, al represent 
annihilation and creation operators for the particles. Since the equation for 
¢ is linear, the notion of what a single particle state is does not change with 
time. In the case with an interaction term, the equation of motion is nonlinear 
and we see that if we start with als then because of the nonlinear term, we can 
get at? ata3,.. terms. Thus a state a} |0) can evolve into al?|0) for example. 
This would describe the decay of a one-particle state into a three-particle 
state. The evolution of a!.aj|0) into afat|0) would describe the two-particle 
scattering with the momenta as shown. All these processes will be generically 
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referred to as scattering. The basic quantity of interest to us is the scattering 
amplitude or the transition amplitude for such a process. 

As we have discussed before, the amplitude for such processes can be 
obtained from the N-point functions 


G(a1,02,+++,2N) = (O|T(d(21)b(z2) --- $(ew))10) (5.4) 


by taking the time labels to too in a way appropriate to the process of 
interest. Therefore as a first step in calculating the scattering amplitudes, 
we shall derive an equation for such functions in the interacting case. The 
generating functional for the N-point functions was defined as 


Z[J| = (O\T exp | i de 1(2)(2)| lo) (5.5) 


In the last chapter, we also obtained the equation (4.32), 


64[J] 


Igy = OF (Ce + m?)4(a) Jel 7#) jo) 0) — iJ (a)Z[J] (5:6) 


(Oe +m? — ie) ——~ 
We have put in the ze explicitly to specify that the Green’s function to be 
used is the Feynman propagator G(z, y). In the discussion for the free case, 
the equation of motion was (1+ m?)¢(x) = 0 and this was used to simplify 
the above equation. The only difference in the interacting case is that the 
equation of motion is different. In fact, using (5.2), we get the equation for 
the generating functional in the interacting case as 


(. + m* — ie) ats 5 + (O|T [4rd%(a)el i |0) = iJ (x) Z[J] (5.7) 


Since 


6 ) ") 
5J (a1) Ga) ea = (0|T | o(@1)b(a2)d(rs)ed oe 0) (5.8) 


we may write (5.7) as 


(Ce +m? — ie) 2214 aes +4) (ss) Z[J]=—is(a)Z|J] (5.9) 


In the free case the solution to this equation was given as 
ZalJ\= exp | 5 [ dady seyGla,a) J) (5.10) 


The solution to (5.9) is then given by 


Z[J] = N exp [Hi i d*z (ais) 


ZolJ| (Sag) 
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N is a normalization factor. The above solution is easily checked as follows. 


bZ{ J] 
bJ (x) 


dZol J] 
bJ(x) 


(OC, + m? — ie) — Ne f (6/53)4 (CO. en? ie) 


= NeW J 6/83)" (4. 7(@)Ze{J}) 
7 ef C/50" (_i3() 0! f (5/63) ZU] 
5 3 
aw (sz) 2(J -iJ(a) Zl] 
(5.12) 


where we have used the identity 


oni [6/8 (_ 5 I(q)ye# S06/5" = -iJ(z) — xf 5/5J)*, J(x)| 


= -iJ(x) — 4d ey (5.13) 


The solution (5.11) can be brought to a more useful form by using the 
following identity. 


[lowe 


for functionals F,G which can be expanded in a power series. Here y¢ is also 
an arbitrary function of the spacetime coordinates; it is not the field operator. 
This identity can be checked as follows. We may write the left-hand side of 


(5.14) as 
[Z| 


i G =| Fi (5.14) 


yp=0 


FiJ] (5.15) 
Using (5.14), we can write (5.11) as 


Z\J| = Nea I C85 ei Sret+f a (5.16) 


y=0 
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where we have used obvious abbreviations like 


6 6 
[55 = [ateay G(a, y) 5y(a) 5p(y) 


This expression can be further simplified by bringing the J-term to the left 
end. We can do this by 


ot [ C55 .f Je ss ed Je fe Yeh S08 f 9) | 
=el J exp je de [ cde! 2 


a, 7 exp| / G65+ (5 if G56, if Jol 


+ alls H Gis, / ek i Jel 
oe E (i G55 + / JG6 +5 i} G4] (5.17) 


The solution (5.16) can now be written as 


Z\J) = e2 fies - "txt 
p=0 

Flg| = Nei J 655 _—* fv (5.18) 

In arriving at (5.18) we have not used any perturbative approximation. 

The form (5.18) is, however, well suited to a perturbative expansion, which 

can be obtained by expanding the exponential exp(—id [ y*) in powers of . 

The quantity exp(—iA f p*) is, of course, exp(iSint), where Sint = —A f y* 

is the interaction part of the action. We can thus generalize the above for- 
mulae to any polynomial type of interaction as follows. 


Zu)= e2 J IGI [-! aa] 


=0 
Fly] = N exp (5 / G65) exp (iSinel) (5.19) 


The normalization factor N is fixed by requiring Z[0] = 1, which is equivalent 
to 
N |. iL 7 = (5.20) 
y=0 
We now turn to the scattering amplitude. Consider first the part of Z[J] 


given by 5) Heng [y]. The contribution of this term to the N-point function 
is obtained as : 
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G(21, 22, wees ,2N) = [anata a dtzy G(x, 21)G(z2, 22) arers 


G(zn, zn) V(a, ZQy 2085 zn) (5.21) 
“) 6 6 
V (ageta, .cpeny) = 5p(21) 5:p(22) fa (5.22) 


y=0 


V (21, Z2,...,2N) is often referred to as a vertex function. 

Consider x}, £3, ...,27 — —oo and 2°,,,...,29, — oo. If the vertex func- 
tion does not extend to infinity, i.e., if it has compact support, we may then 
eee 095 29D ematand exe, ...29) Siz? |, .. eke Pheipropa 
gators G(z,z) can then be replaced by their expressions for the appropriate 
time-ordering. We then find 


G5, <= [dads os d*tny > ue (21) up, (21) Uk, (22) UZ, (22)... 
k; 

Ee (Pn41)Ukn as nite W211, 22s oN) ares) 
Along the lines of our general interpretation of the N-point functions as prob- 
ability amplitudes, this has the interpretation as the probability amplitude 
for the propagation of particles introduced at x1, X2,...,%n to be observed as 
particles at ©,41,...,2N. This is indeed the quantity of physical interest, but 
it is expressed in a basis where the particle positions are specified. For most 
scattering situations, we specify the momenta. The corresponding amplitude 
can be obtained by projecting out the momentum eigenstates by appropriate 


Fourier transformation using the orthonormality relation (1.3) from Chapter 
1. The result is 


Ste; ..-, hn — Knol, kn) = [dade ---d*2Nn UR, (21) Uke (22)... 
When rs (Zn+1)--Uiey (zu) V (21, 22, «++ ZN) 
= [tad -++d*zy Uk, (21) Uk, (Z2)-+- 


Vin +1 (2n41)+--Upy (ZN) 
) “) ) 
Sole) ble) olan) | 
(5.24) 


x 


S(ki, ke, ..-: kn — kn41,...,kn) is the amplitude for the scattering process 
with the momenta indicated. It can be thought of as the matrix element 


S(k1, ko, sey kn — Rn, seey kn) = (knea see ky|S|ku, ka, os) kn) 


of an operator S which is appropriately called the scattering operator or 
the S-operator. Equations (5.22,5.24) show that this matrix element is de- 
termined by the functional F [vy]; it is thus appropriate to refer to F [yp] as 
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the S-matrix functional. The functional differentiations in (5.24) show that 
the S-matrix element is obtained by replacing the y’s in F[y] by the one- 
particle wave functions, ux, for the incoming particles and u; for the outgoing 
particles. 

So far we have not included the effect of the terms which arise from 
exp(4 f JGJ) in Z[J]. This can lead to terms in G(a),22,---,2n) of the 
form G(x1,22)G(«3,...,£N) and similar product forms where a number of 
free propagators are multiplied by a many-particle propagator with less than 
N particles. From the factorization of the propagators in terms of ux, uj, it 
is clear that these correspond to processes where some of the particles do not 
participate in the interaction but just fly by; they are propagators discon- 
nected from the main part of the scattering process. Thus contributions from 
exp(3 f{ JGJ) describe subscattering processes and are not of great interest. 
(The nontrivial scattering contribution in such terms is taken account of at 
a lower order in J.) 

Notice that from (5.21) we may write 


V (21, 2;--; N= ‘a, + m?)G(a1,22,+++,2N) (5.25) 
9 
where we have used the relation i(0, + m?)G(z, y) = 6 (x — y). We may 
thus write the scattering amplitude as 


Tr 
S(k1, ke, 22) Bn =a Kn41) BEA = TI fv (z;)t(Q, +m?) x 
n xz 


I xc i(O,, +m?) G(21,...,2N) 


r=n+1 
(5.26) 


This result is often known as the reduction formula. (The operators i(O,, + 
m?) cancel poles in the propagators and hence, in this formula, partial inte- 
gration of the derivatives such that they act on the one-particle wave func- 
tions is not justified. Alternatively, the wave functions may be taken to be 
ee to be set to being solutions of the free wave equation only at the 
end. 
By substituting J = i(01+ m? — ie)y in equation (5.19) and doing some 
partial integrations, we see that the S-matrix functional can be written as 


Fiyj=e tS eC)? G4 m)¢) (5.27) 


i.e., it is just Z[J] ee for the choice J = i(0 + m? — ie)y, apart from 
a trivial factor of exp(—$ f e(0+ m?)y). 


5.2 Wick’s theorem 61 
5.2 Wick’s theorem 


We now want to do the perturbative expansion of the S-matrix functional. 
In doing this, we encounter terms of the form exp(f $G656) p(1)y(2) --- p(N) 
where (i) stands for y(2;). 

Consider the action-of one power of f $G66 on the fields y. 


r) 6 
ao@laeg) ° n=5\ eee” 


6 
— 


= G(1,2) (5.28) 


The operator [ $G55 replaces the pair of fields by its propagator. This is 
known as Wick contraction. When applied on y(1)p(2)--- pV) we get 


iE 3G66 p(1)e(2) - = S°G(i, fe): ep — 1)yeG+1)--- 


t<j 


9(j — 1)p(j + 1)--- p(N) (5.29) 


Notice that we again get the sum of all pairings or Wick contractions on the 
right-hand side, with one propagator only. There is no other numerical factor. 
The G(i, 7) term arises in two ways: when the first functional derivative acts 
on y(i), the second on y(j), and when the first functional derivative acts on 
y(j) and the second on y(i). This removes the factor of 5. 

Now consider the term quadratic in the G’s in the sapiasion of exp (fs G66). 
We find 


a | 3655 f $455 9(1)p(2)9(3)e(4) = G(1, 2)G(3, 4) + G(1, 3)G(2, 4) 
+G(1,4)G(2, 3) (5.30) 


We again get the sum of all Wick contractions. The pair (1, 2) can occur from 
the action of the first f3 =G66 or the second; this removes the 3; ;. Applied on 
y(1)p(2)--- y(N), we = the sum of all terms with two Wick contractions 
with no other numerical factors. Continuing in this way, we get the formula 


i 
ef 2699 91) 9(2)---o(N) = o(1)y(2)--- oN) 
+ yy terms with 1 Wick contraction each 
+ > terms with 2 Wick contractions each 


+ ‘3 terms with 3 Wick contractions each 
Sn (5.31) 
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This result is known as Wick’s theorem. It gives a simple rule to carry 
through the differentiations that we encounter in using the formula for F[y]. 
We shall refer to exp(f 5G66) as the Wick contraction operator. 


5.3 Perturbative expansion of the S-matrix 


We now consider the perturbative expansion of the S-matrix functional as 
given by (5.18). This is easily done by expansion of the exponential in powers 
of . 


1. Zeroth order in » 
We get Z[J] = Zo[J] and there is no nontrivial scattering. 
2. First order in > 


For the first order term we get 
Nea) = 8 FO55iy f oA) 
= ~in | 4 = 10) [ 9(2)?6(@,2) 
m3 H Claraeeons) (5.32) 


We shall analyze each of these terms separately. The first term leads to a 
vertex function with four points and hence to processes with four external 
particles. This can describe a decay process 1 — 3, a 2 — 2-scattering, or a 
3 — 1 process. The amplitudes are easily written down by replacing the y’s 
by the wave functions as in (5.24). For the 1 — 3 decay process, we have 


ok eee / dar un(a)ut, (x)us, (a)us, (2) 


= (4) 4) CMO (k= vn — Pa = Bs) 
(2u,V) (2up, V) (Qu, V) (2wp,V) 
(5.33) 


We can represent this diagrammatically as shown below. This diagram, known 
as a Feynman diagram, not only gives an intuitive picture of the process in- 
volved, but it is also a mnemonic for the mathematical expression for the 
scattering amplitude. We associate a factor of (1//2wV) with each external 
line; the vertex carries a factor of (—iA)4! and an energy-momentum con- 
servation 6-function (27)* O54 (kk — pi — p2 — p3). Formula (5.33) can then be 
written down by taking the product of such factors. (Since all the parti- 
cles involved have the same mass, this particular process is forbidden by 
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energy conservation. The amplitude can be written down as in (5.33), but 
the 6-function has no support and vanishes. The same rules for associating 
mathematical expressions with a diagram are general and apply to situations 
for which the amplitude does not vanish.) 
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Fig 5.1. 1 — 3 decay process 


For the 2 — 2 scattering process, we have 


_ enti atbemap) 6 34) 


S(ki,k2 — pi,p2) = (—i,)4! (Qu, V) (Qu. V)(2wp, V)(2wp, V) 


We can represent this diagrammatically as follows. This describes two-particle 
scattering to the lowest order in the coupling constant X. 


Fig 5.3. Scalar self-energy or mass correction 
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The term —i6AG(z,z)y(x)* can only lead to processes with two external 
particles. It gives a correction to the propagation of a single particle and 
contains information about corrections to the mass of the particle due to 
its self-interactions. Representing the propagator G(x, y) as a line from z to 
y, we can diagrammatically represent this term as shown in figure 5.3. The 
proper treatment of this term requires ideas about renormalization and will 
be postponed for now. 


Fig 5.4. The first order vacuum process 


The term —i3AG(zx,x)G(x, x) can be diagrammatically represented as in 
figure 5.4. It is a pure vacuum correction, i.e., changes (0|0); but, of course, 
it is canceled by our choice of normalization NV. In fact 


iN Lop i3d [ ate G(ax, x)G(z, x) + O(A7) (5.35) 


(There can be situations where the vacuum diagrams can be important 
physically. If we consider field theory in the presence of external fields, vacuum 
or ground state (which now includes the external field) can decay or undergo 
other interesting changes. These can be described by the vacuum diagrams. 
Likewise, for fields at finite temperature, the vacuum diagrams are important 
temperature-dependent corrections to the partition function.) 


2. Second order in X 


The various terms of order \? are given by 
= 1 —iX)? 
NF = 2 J G66 ws [atadty y*(x)y*(y) (5.36) 


This leads to many terms which are repetitions of first-order processes. Thus 
we get terms which can be diagrammatically represented as 
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x : ‘ ¢ N / a 
Si a ‘ ? \ ¢ Me ¢ ais 
~ ¢ \ ? \ he ie \ 
\ « Ne . 2 wee ‘ 
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Fig 5.6. Another disconnected diagram, repetition of first order processes 


Genuinely new terms arise when there is a propagator G(x, y) connecting 
points z and y. The simplest such term is 


(—ir)? 74 3 3 
a 4 | Gy)" (y) (5.37) 
Pi P 2 D3 
¥ 
A 
A, 
e rN ® 
ky ko kz 


Fig 5.7. A contribution to 3 — 3 scattering process 


This term includes many processes, such as 3 — 3 scattering, 2 — 4 
scattering, 1 — 5 decay, etc. As an example consider 3 — 3 scattering 
k1, k2,k3 — pi, p2, p3- The amplitude is given by 
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S(ki, pi) = (—iA4!)? ‘i Uk, (L)Uk, (©) UK, (2G (zx, y)us, (yup, (yup, (y) 
+ permutations 
i (21)?6@ (0, ki — 30: Pi) 
(ky + ko + ks)? —m? + ie IL, V 2wk, VV 2wpV 


+ permutations 


= (—iM4!)? 


(5.38) 


Diagrammatically this can be repesented as shown in figure 5.7. If we make a 
rule of representing each internal line or propagator in momentum space by 
(i/(k? — m? + ie)), then the mathematical expression can be written down at 
once from the diagram. The conservation of four-momentum at each vertex 
gives the correct momentum (in this case k; + kz + k3) to the propagator. 
Various other processes contained in (5.37) can be treated similarly. 

The terms with two propagators connecting x, y are contained in 


(—id)? (4 x 3)? 


a all / dixdty p(x) p*(y)G(2, y)G(a, y) (5.39) 


This can give the next order (O(A) corrections) to 2 — 2 scattering, 1 — 3 
decay, etc. For example, for the scattering k1,k2 — pi, po, we find 


—id4!)? 
oe fl Uk, (©) Uk, (2)G (a, y)G(a, yup, (y)us, (y) 


+ Uk; (x)us, (x)G(z, y)G(z, Y)Uke (y)us, (y) 
+ we, (x)up, (2)G(x, G(x, y)enes(y)up,(y)] (6-40) 


These terms are obtained by different assignment of momenta to the basic 
process represented diagrammatically by figure 5.8. 


Fig 5.8. O(A?) correction to 2 > 2 scattering 


The factor of 5 in (5.40) follows from straightforward functional differenti- 
ation, but it can also be understood as arising from the symmetry of the 
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diagram under the exchange of the two propagators. Putting in G(z,y) and 
integrating, the first of the terms in (5.40) becomes 


(—id4!)? (21)45 (ky + ke — pr — pa) 


2 (2a, V)(2wWkeaV)(2Wp, V)(2wp, V) 
 d4q i i 
SS TFT oor So Al 
«| oe (ki + ke + q)? —m?+ ie gq? — m? + ie eat) 


We find a new rule that we must integrate over the loop momentum q with the 
measure (d4q/(27)*). This concludes our discussion of the basic diagrammatic 
rules. 


It is clear that the expansion of F[y] can be carried out to any order and 
that at each order, the terms in F[y] describe a variety of processes. 


5.4 Decay rates and cross sections 


The S-matrix functional gives the amplitude for a process via (5.24). The 
transition amplitude, we have seen, has the form 


1 
S= (1 sa) (27)46(S°k-S op) M (5.42) 


where the product is over all external particles and M is an invariant matrix 
element. 


Decay Rate 


Consider the decay of a particle of momentum k into n particles of mo- 
menta p;. The S-matrix element has the form 


—: Il 1 
aa Df 2uiV 


The square of this amplitude gives the decay probability. The square of the 
6-function is ambiguous. This problem arises because we have considered 
infinite spacetime volume. We must interpret it as 


/ rei *- Dv if dtye Lok Lire — (2n)46 (Sk - Sop) ‘| dy 


= (2r)46 o> k- wr) Vr 
(5.44) 


Ss (2)*6 (k — Sp) M (5.43) 


where 7 is the range of time-integration, taken finite for now; we shall take 
7 — oo eventually. The absolute square of (5.43), keeping in mind (5.44), 
gives 
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ISP = 5a [Tap Onto e- Dop Vr IMP (5.45) 
k ; a 


The decay rate is thus given by 


LSI? eal 4 5(4) 2 

= ra (20)46(k — Sp) 1M| (5.46) 
This is the decay rate into specified sharp values of the final momenta (of the 
outgoing particles). In practice, we are interested in scattering into a small 
range d?p,,d°po,... of each final momentum around mean values pi, pa, .... 
This can be obtained by summing over all states with this range of momenta, 
viz., (Vd? p;/(27)° for each p;. Thus the decay rate into the specified ranges 
of final mometa is given by 


_ Pp 
YsOR- >” l re oe : 5.47 
The total decay rate is given by integration over all final momenta as 


ne / dr ; (5.48) 


The lifetime of the particle is given by 1/I. Notice that df and I are in- 
variant except for the factor (1/2w,). This factor gives them the correct 
Lorentz-transformation property and gives the time-dilation effect for life- 
times of fast-moving particles. 


Cross sections 


We now consider a 2 — n scattering process with momenta k,,kz2 — 
P1, P2;---;Pn- Lhe amplitude has the form 


1 


= Fey gece Ugagp On +h DM (6.49) 


Taking the absolute square using (5.44) as before, we get the rate for the 
process as 


sje 1 at 
T Que, 2wyp. V Nene gy (2m)*5 (ha a oe S~p) |M|? (5.50) 


We must divide this by the flux to obtain the cross section. The flux can be 
computed as follows. Consider the collision of two particles of masses m1, m2, 
and momenta kj, kz. If one of the particles is at rest, say, k, = (m,0,0,0), 
the flux due to the other particle is given by 
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roo 

V Wks 

since (|k2|/w,,) is the speed of the second particle and (1/V) is the density 

of the particle for the plane waves we have. We can generalize this to any 

frame as follows. Since k? = w? — ky? = m?,k3 = w? — ko” = m?, the only 

kinematic invariant is k; - ko. We have (k1 - kz)? — mim ited (Oy Res) = 

m?k2* if ky = (m,,0,0,0). Thus the generalization of (5.51), preserving the 
necessary 1 «> 2 symmetry, is 

1 JV (ki - ka)? — mim 


V Wk, Wke 


(5.51) 


ite (5.52) 
Dividing the rate (5.50) by this flux gives the cross section for scattering into 
sharp values of the final momenta. For a range of momenta of dispersion d?p; 
around the p;, we get the differential cross section 


do = |M|? (27)46 (ky + ko — Op) Il dp; 


—— 53 
4,/(k1 - k2)? — m4 wp, (2m) a 


i 
where we have set m, = m2 = ™, which is the case for us. Notice that do is 
completely Lorentz invariant. Also factors of V anf 7 have canceled out, so 
we can take the limits V, tT — oo. 


5.5 Generalization to other fields 


The generalization of the S-matrix functional to the case of many types of 
fields in interaction is quite straightforward. Notice that the formula (5.19) 
involves the propagator of free particles and the interaction part of the 
Lagrangian. Thus, once we know the propagators and the interaction La- 
grangian, we can immediately write down the expression for the S-matrix 
functional. 

As an example, consider a theory with two types of bosonic fields, denoted 
by y and €, of masses yz; and pe, respectively, and two types of fermionic 
fields denoted by P and N with masses m, and mz, respectively. The various 
propagators are then given by 


atk i —ik(e—y) 
Gyles y= Cle, With) = | coma | a Saat, | © 


d*k i ~ik(e—y) 
Go(z, y) = G(z, y, H2) = (2r)4 \ k2 — p32 + ie : 
d’p Y-P+™M _.-ip(e—y) 
(27)4 p? — mj + te 
dp y-pt+me 


= Lag fj) J ee es rea) 
So(zx, y) = S(z, Y, m2) a if (27)4 p = ma aa 


Si(x,y) = S(z,y,m1) =1 


(5.54) 
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The S-matrix functional can then be written as 


FleyEnPQ = Nexp|5 lee @ Glenn) gro] 
cot | aaa] 
x exp |= lme Sro(x,y,™1) = — aan 


0). 9 | tSinelv.é.P.N] 
x exp | ih be 2 ‘ 


(5.55) 


where we have used the same symbols y,€, P, N to denote the (operator- 
valued) fields and the corresponding c-number functions which appear in the 
expression for the S-matrix functional. 

The one-particle wave functions to be used in the calculation of the scat- 
tering amplitudes are the scalar and spin-5 wave functions given before, with 
the appropriate masses. 

Another interesting set of examples is given by the theory of nonrelativis- 
tic particles which can be considered as the theory of the Schrédinger field. 
The Lagrangian for the free theory is 


Oy 


= ire 2 vy (5.56) 


It is possible to quantize this field either using commutation rules giving the 
theory of nonrelativistic bosons or anticommutation rules giving the theory 
of nonrelativistic fermions. Consider the fermionic case as an example. The 
anticommutation rules, obtained via the canonical one-form, are 


{ 4(2°, 2), ¥(2°,y) }=0 
{yt(°, x), vi (2°, y) }=0 (5.57) 
{ p(x°, x), pl (2°, y) } a 6) (a =4) 


The one-particle wave functions are given by 
uz(z) = i e Ena +ik-x (5.58) 
JV 
where E; = k?/2m. The mode expansions for the fields are given by 
v(2) = Sr axup(z) 
k 
v(x) = J ajuz(c) (5.59) 
k 
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where 


{ax,ar} = {a}, af} = 0 
{ax,a]} = dx (5.60) 


The propagator is given' by 


; (27r)4 ko — Fy, + te 
S(z,y) is the inverse to Op — iV?/2m. 
The S-matrix functional is given by 


Fly| = Nexp |- ih ORD er Sint lb" (5.62) 


) 
dy*(y) 
where w,7)* are Grassman-valued functions now. 

The functional differentiations involved in evaluating the S-matrix func- 
tional can be carried out by using the rule of Wick’s theorem given earlier. 
Wick’s theorem for fermions works just as in the case of bosons with possible 
extra signs from moving the Grassman variables around to bring them to the 


correct order to be identified with the propagator. For example, for fermion 
fields w, w, 


W H(1)%(2) = ¥(1)h(2) + S(1, 2) (5.63) 
where we have denoted by W the Wick contraction operator 
6 ) 
exp - | aSovagey| =W (5.64) 


For the product of fields in the order 7(2)7(1), we will get the propagator 
—S§(1,2). We may think of this as writing 7(2)7(1) = —(1)(2) first, so 
that the fields are in the order in which they appear in the propagator and 
then identifying the pair with the propagator. By extension we then find 


Wb (1)b(2)h(3)h(4) = b(1)d(2)0(3)v(4) — S(2, 1) (3) H(4) 
—1)(2)b(3)S(4, 1) + b(1)¥(4)S(2, 3) 
—4(1)(2)S(A, 3) 
4S(2, 1)8(4, 3) — $(2,3)S(4, 1) (5.65) 


We see that we get all possible pairings with coefficients equal to +1 depend- 
ing on the number of transpositions of the Grassman-valued fields. This kind 
of rule extends easily to higher numbers of fields. 
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5.6 Operator formula for the N-point functions 


We now obtain a formula for the N-point functions expressed in terms of an 
operator expectation value. We start with the formulae (5.10, 5.11), which 
give 


Zo|J] = exp [5 f dtxd*y J(x)G(z, y)Jty)| 


Z(J| =Nexp -a fate (ts) | Zo|J| (5.66) 


Since Zo[J] is the generating functional for the N-point functions of a free 
scalar field, we can write, using the definition (5.5), 


Zo[J] = (O|T exp il d‘x Ha)din(2)| |0) (5.67) 
where ¢in(x) is a free field. In other words, it has the expansion 


din(t) =) axup(x) + al uz(z) (5.68) 
k 


(The operator ¢in(x) is called the “in-field” because the incoming states 
can be contructed in terms of its action on the vacuum state.) Using the 
expression (5.67) for Zo[J] in (5.66), and doing the functional derivatives 
inside the matrix element, we can write Z[.J] as 


AJ] =N(O\Tel 7 e~ J Hn 10) 
= N(0|Teds 14 eiSine(#in) 9) (5.69) 


By functionally differentiating N times with respect to J and setting J to 
zero we obtain 


G(x, 4D bane rN) = (0o|T din(£1) in (x2)...din(an ent (Fin) |Q) (5.70) 


This gives the Green’s functions of the interacting theory in terms of free field 
expectation values. Evidently a similar formula is obtained for more general 
situations and more general interaction terms in the Lagrangian. 

We can also write the scattering operator in terms of the in-fields $;,,. In 
the above formula, we have arbitrary Green’s functions expressed in terms of 
din. We now take a set of time-labels 29, 29, ...,29 to —oo and x9,4,...,29 
to +oo. In this limit, by virtue of the time-ordering, we can write (5.70) as 


G(21,%2,....tn) = (O|T Gin(tn41).--bin(en)T e®int(Pin) (ar)... (an )|0) 

(5.71) 
where we assume that the fields in the interaction term do not extend to too 
in time. Since G(x1, Z2, ..., 2) represents the scattering amplitude (albeit in 
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terms of positions rather than momenta), we see that the scattering operator 
is given by 


S =T efSint(Gin) — U (oo, —oo) 


U(a?,y°) =T exp if da? [dx Col din(2) (5.72) 
y? 


This is a very simple formula: the S-operator is given by the exponential of 
the interaction part of the action with the free field operator din, substituted 
for the field ¢. The S-operator as defined by the above formula is unitary. 
The operator U defined above obeys the properties 


UE Ug 2) va.) 
Ul(29, 9°) =U, 2°) =U (2°, y°) (5.73) 


for 29 > y® > 29. 
One can also define, at this point, the interacting field operator @ by 


$(2) = bin (a) +4 | Ga(z,y) ply) (5.74) 


where Gr(z, y) is the retarded Green’s function of (4.9) and p is defined to 
be ” 
~ 6S 
z) = —iS- 
p(x) soaaea) 


The functional derivative with respect to in is understood as follows. We 
shift @:, in S by a c-number (non-operator) function 6 f(z), i-e., din @ din + 
df. The derivative is then the coefficient of 6f in the variation of the operator. 
In other words, the derivative is defined by 


(5.75) 


Ja 6S 
5S = i = (5.76) 


Since the retarded Green’s function obeys the equation (+ m?)Gr(z, y) = 
—id 4 (x — y), we find that 


(+ m’)¢(z) = p(z) (5.77) 


Conversely, (5.74) may be thought of as the solution of the above equation 
with the retarded boundary conditions, which is appropriate if 6 — ¢in as 
x2° — —oo. In principle, we can express p in terms of ¢@ rather than ¢in 
by using (5.74). This will give an equation of motion expressed entirely in 
terms of ¢. We shall now see that this agrees with the Heisenberg equation 
of motion, so that ¢ as defined by (5.74) can be taken as the Heisenberg field 
operator itself. First of all, a solution to ¢ as defined by (5.74) is given by 
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(2) = U(—o0, 2°)din(2)U(a°, -00) (5.78) 
We can check this by rewriting it as 
(x) = din(x) + U(—00, 29) | din (2), Ue, ~o0)] (5.79) 


Since U(x°, —oo) is defined in terms of the free field ¢;,, the commutator can 
be evaluated as 


|in(z), U2, ~o0)| By rf 9 d*y U(x, y°)A(z, y) @ ae U(y°, —co) 
=i f ay Gale, Ue? 9) FAP u(?, -00) 
(5.80) 


using Gr(z,y) = O(x° — y°) A(z, y). Equation (5.79) can thus be written, by 
combining U’s using (5.73), as 


#(2) = dn(e) +4 f Gale,u)U (00,99) PMU (y?, 00) 
6 bSint(¢) 
= din(z) + if Cnt y) 56(y) (5.81) 


where we have used (5.78) again in the last step. Now, from the definition of 
U(x°, y°) in (5.72), we see that 


5S 
6Pin(Z) 
Comparison of this equation with (5.81) shows that (5.78) is indeed a solution 


to (5.74). Further, from (5.74) and (5.78) we see that p can be expressed in 
terms of the interacting field ¢ as 


bSint (din) 


p(x) = -iS* ae 


= U(—00, 2°) U(zx°, —oo) (5.82) 


5S 
ae bSint (¢) 
=a (5.83) 
Consequently, (5.77) is the Heisenberg equation of motion for a field with 
the action S = f 5[(0¢)? — m?¢?] + Sint(¢). In other words, ¢ as defined by 
(5.74) can be identified with the operator ¢ we started with in this chapter. 


Finally, notice that, since ¢j, is a free field, we can write the N-point 
functions of this field as 


p(x) = -i8-! 


5.6 Operator formula for the N-point functions 75 


(0|T din(£1)bin(X2)..-bin (4 )|0) = Coe ae lel 
J=0 
set) OH oteriten)oCen) 
y=0 
(5.84) 


where we have used (5.67) and (5.16), with the interaction term set to zero. 
Using this formula for the N-point functions of ¢;, and equation (5.19), we 
find that we can write the S-matrix functional as 


Fly] = N(O|T exp [iSint (din + )] |0) (5.85) 


References 


1. The idea of the S-matrix is due to W. Heisenberg, Zeits. fiir Phys. 120, 
513, 673 (1943). Covariant techniques for the perturbative calculation of 
the S-matrix were developed by J. Schwinger, S. Tomonaga, R.P. Feyn- 
man, F. Dyson and others. For original papers, see J. Schwinger, Selected 
Papers in Quantum Electrodynamics, Dover Publications, Inc. (1958). 

2. The use of the Heisenberg picture for the S-matrix, which is how we start 
in this chapter, is due to C.N. Yang and D. Feldman, Phys. Rev. 79, 972 
(1950). It led to the more complete LSZ formulation of H. Lehmann, K. 
Symanzik and W. Zimmerman, Nuovo Cimento, 1, 1425 (1955); ibid. 2, 
425 (1955); ibid. 6, 319 (1957). The reduction formula and the in-fields 
were introduced in this work. 

3. The operator formula for the N-point functions is due to M. Gell-Mann 
and F. Low, Phys. Rev. 84, 350 (1951). 

4. An S-matrix functional has been defined in I. Ya Aref’eva, A.A. Slavnov 
and L.D. Faddeev, Theor. Math. Phys. 21, 1165 (1974). In this context, 
see also L.D. Faddeev and A.A. Slavnov, Gauge Fields, Introduction to 
the Quantum Theory, Benjamin-Cummings, MA (1980). 


6 The Electromagnetic Field 


6.1 Quantization and photons 


The action for the electromagnetic field is given by 
d, 
ce [-yRer™ iy Bi (, 1(B2—B2)—Ap +A; (6.1) 


where 


Fup => QnA, = OVAy 
Po = E; Fi; = ijn Br (6.2) 


A, is the four-vector potential. The equations of motion for this action, ob- 
tained by varying the A,,, are the Maxwell equations 


OFM = J” (6.3) 
which can be written out in terms of the components as 


0:E; = Jo (6.4) 
OoF; + €:jk07 Br — 8p (6.5) 


Notice that, since 0,0,F4” = 0, for consistency, the current to which A,, 
couples, viz., J¥*, must be conserved. In other words 


6,J4 =0 (6.6) 


In considering the quantization of this field, we must take account of two 
related facts. Notice that if we have two vector potentials A, and Aj, = 
A,,+0,6 for some scalar function 6, the corresponding electric and magnetic 
fields are unchanged, i.e., Fy,,(A) = Fy,(A’). Thus we have a redundancy 
of variables in using A, to describe the electromagnetic field. We will need 
to eliminate the redundant degrees of freedom before we can apply the rules 
of quantization. We also notice that equation (6.4) cannot be realized as a 
Heisenberg equation of motion. Heisenberg equations of motion are of the 
form O09C = something, for an operator C. Equation (6.5) shows that the 
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equation of motion for A; is second order in time-derivatives. Thus, £; = 
0A; — 0;Ao and A; form the set of phase-space variables at a fixed time. 
Their initial values must be specified whereupon their time-evolution is given 
by (6.5). In terms of such phase-space variables, (6.4) has no time-derivatives 
and hence it cannot be obtained as a Heisenberg equation of motion. 

The elimination of the redundant variables can be done as follows. First of 
all, we can choose Ag = 0. For, if it is not zero, we use the physically equivalent 
set Al = Ag + 090, Ai = A; + 0,0 and choose @ such that 099 = — Ap to get 
Ao = 0 and some nonzero Ai, which we can rename as A;. We can then split 
Aj as 

Ay = Al + O;f . (6.7) 


where AT is ‘transverse’; i.e., it obeys the condition 0; AT = 0. This leads to 
the relation E; = 09 A? +0;(00 f). Equation (6.4) now tells us that 0;0;(O0f) = 
Jo; or 


ar / dy Gola — y)Jo(2°,y) 


0;0;Go(x — y) = 6(3) (x —y) (6.8) 
I a! 
Go(#—y) = ars a 


Gc(a — y) is the Coulomb Green’s function. B;(A) = B;(A7) since 0;f has 
vanishing curl. The A;J;-term can be simplified as 


i d‘z AjJ; = / d*a(Al J; +0;:f i) = / d*x(A? J; — f0;J;) 
A / al AT J, pon / d*x(AT J; + of Jo) 
= / d*z A? J; 
+ i dx dad?y Jo(x°, )Go(x — y)Jo(2°, y) (6.9) 


Using equations (6.7) to (6.9) in the action (6.1), we find, upon partial inte- 
grations, 


1 
= 4 
Ss = fe x 3 [@0.AF Oo.AF = 0,47 0;A7 | 


1 
+ f andy 5 Jo(x)Go(w —y)b(2® ~ ¥)Jo(u) + pee ats 
(6.10) 


We have included a 6-function in the time-variables for the term involving 
Jo’s to write the integration measure in a covariant form. We see that (6.10) 
is equivalent to the action for two massless fields corresponding to the two 


6.1 Quantization and photons 79 


transverse directions or polarizations of A?. (The combination of conditions, 
Apo = 0 and V- A = 0, which our physical fields A? obey, is called the 
radiation gauge.) In the absence of the current J“, the equation of motion 
is 

O A? =0 (6.11) 


which is the same as (6.5). The general solution to this equation can be 
written as 


A? (x) = Dy axe up (a )+ ate u (x) 


eo (6.12) 


where w, = Vk? and kz in the exponent is, as usual, koro —kj xj. el), A=? 
are unit vectors in the two independent directions transverse to k; (or 0;), so 
as to be consistent with the condition 0;A? = 0. For most purposes, we do 
not need an explicit form for these; one choice, if an explicit form is needed, 
is 


ef) = = —ki, 0) 


vere 2, 


e(2) = a kok, —(ki + k3)) (6.13) 


Vke + keVk-k 


Along with e?) = k;/Vk-k, these form an orthonormal triad of unit vectors. 


We also have heck 
(ACO) — [... 
>; €; e€; — @ | (6.14) 
dA=1,2 


This can be seen as follows. The right-hand side must be a symmetric tensor 
P;; with ki P;; = 0, k;Pi; = 0 since kje; = 0. Thus P;; must be proportional 
to (6;; —kik;/k-k). The proportionality constant is seen to be 1 by evaluating 
the trace. 

In the quantum theory, the coefficients a,), a in (6.12) are operators. 
The canonical commutation rules are obtained from (6.10) via the canonical 
one-form. Equivalently, notice that the action is essentially that of two copies 
of a scalar field (even though the transformation properties of the fields are 
different) and hence the commutation rules can be easily seen to be 


lana, axa | = 0 
lat, aba] a0 (6.15) 


lara, ake. = Sick Dan 
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Since the free theory without currents mimics two scalar fields, the Hamilto- 
nian for the free case is 
Hum >> wea} ana (6.16) 
kd 


As in the case of the scalar field, Lorentz invariance requires that we choose 
an ordering of operators such that there is no zero point energy or vacuum 
energy. 

The vacuum state obeys az,{0) = 0. One can build up many-particle 
states as we have done in detail for the scalar field. The particles in this case 
are massless (since w, = Vk-k) and come in two polarizations orthogonal to 
k. They are photons. a} , is the creation operator for a photon of wavevector 
or momentum k and polarization 4. 

The propagator can be obtained as 


Dij(a,y) = (0|T AF (2)A; (y)|0) 


d*k kik; 4 —ik(x— 
= | om (5; - =) eee (2-9) (6.17) 


This is not manifestly covariant as it stands. If we start from an interaction 
of the form A,,J”, then the interaction part of the action, after elimination 
of redundant variables, is 


1] 
ae i andy 5 Jo(2)Gc(a — u)6(2° — y)dJoly + dx ATI; (6.18) 
The S-matrix functional can be written down as 


if “) 6 iSine 
F(A) = exp 5 [Pate mB on (6.19) 


The first term which involves a photon propagator is the term quadratic in 
the currents. The quadratic terms are given by 


1 
FO) = 5 / dad*y 5 Jo(2)Gc(x — y)4(a° — y°)Jo(y) 


a : / Pad*y Ji(x)Dig(x, 9) J5(y) (6.20) 


The term involving the kjk;/k-k part of the propagator may be written as 


k; k; e~tk(z— v) i —ik(x—y) 
I(x) IG (y i. =f. 0; Ji(x le es 
ih (x) J;(y) k-k bk Raia (x) 0; Jy) EOE nes 
1 e—th(z—-y) 


= | seean i ee 
iL oJo(z)Bo Wf ez k2 + ie 


k2 e~tk(x—-y) 
[, o(7)Jo) | ek Rte 


(6.21) 


? 
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where we have used the conservation of the current as in (6.6). Noting that 
Gc(x — y) is the Fourier transform of —(1/k-k), we can combine (6.20,6.21) 
to get 


1 : 
(2) = — 4, , 2 ame 
Ee Th xd u[ote)so(w) [ Ree 


4 ; 
— ea: —ik(z—y) 
@IW | wire 


= 5 [ atody J(2)Dur(e,v)J"(w) (6.22) 


d*k 4 
Dio y) = Nav / Gnerame 


e *k(e—y) (6.23) 
Thus the propagator, when applied to conserved currents, can be taken as 
the covariant propagator given by D,,(x,y). (Even in this form, one has the 
freedom of adding a term proportional to 0,0, to nuv, since J“ is conserved. 
This will also correspond to the freedom of gauge choice; it is discussed in 
more detail in Chapter 10.) The above result is exactly what we get if we 
apply the covariant Wick contraction on a covariant interaction term, viz., 


: 5 § #P 
a5 ayy aT fad v 
F 9 in Dwar a iA) 2 pas fas (6.24) 


For all terms in the S-matrix functional involving photon propagators, a 
similar simplification can be done. (We do not discuss this in detail here 
because these issues become much simpler and clearer once the functional 
integral is introduced.) For incoming and outgoing photons, we may still 
write the covariant form of the interaction with the understanding that the 
polarization vectors vanish for the time-components and are transverse to k 
for the space components. The S-matrix functional may therefore be taken 
as 


F = exp (-3 i= Plo erp) ei f Aus" (6.25) 


6.2 Interaction with charged particles 


The coupling of the electromagnetic field to charged particles follows the 
“minimal coupling” principle. This may be formulated as follows. We have 
seen that in describing the electromagnetic field using the gauge potential A,,, 
there is redundancy of variables. Both A, and A’, = A, + 0,0 describe the 
same physical situation since the electric and magnetic fields are the same. 
The change A, — A, + 0,0 is a gauge transformation and one can say that 
the Maxwell equations are invariant with respect to gauge transformations. 
This invariance principle is the key to electromagnetic interactions. Coupling 
to charged fields must respect this invariance. 
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Consider a complex scalar field; the free Lagrangian is 
L = 0,60"¢" — md" (6.26) 


This theory has invariance under the transformation ¢ — e~*294 where 6 is 
independent of x“. In other words, L(e~*??d) = L(¢); we have already seen 
that this leads to a conserved current; Q is the charge carried by a single 
particle as given by this current. This symmetry is now made local; i.e., it 
is extended to the case where @ is considered to be a function of x. Since 
O,(e~*2%d) = e~*29(8, — 1Q0,6)¢, clearly the terms in Lagrangian (6.26) 
with derivatives of the fields are not invariant. However, using A,, we can 
form the combination 


Dud = Onb + 1QApo (6.27) 
We then have 
D,(A')¢! = e~*@? D,(A)o (6.28) 
Al, = Ay + 0,0 
db! = e104 (6.29) 


In other words, D,,¢ transforms homogeneously (or covariantly) even for lo- 
cal x-dependent 6 if we combine the phase transformation of ¢ and the gauge 
transformation of A,,. In fact, the phase transformation of ¢ can be identified 
as the gauge transformation for the matter field ¢. D,,¢ is called the covari- 
ant derivative. Since it transforms covariantly, we see that if we replace the 
derivative 0, in the Lagrangian (6.26) by D,.¢, we get a Lagrangian which 
has invariance under the gauge transformations (6.29) and has interactions 
coupling ¢, ¢* to the electromagnetic field. This is the gauge principle, which 
can be summarized as follows. 


Charged fields have Lagrangians which have invariance under con- 
stant phase transformations of the fields. This symmetry is made 
local by replacing all derivatives in the Lagrangian by the covariant 
derivatives. The resulting Lagrangian is gauge invariant and incorpo- 
rates coupling to the electromagnetic field. 


The application of this principle gives us, for the charged scalar field, 
L = DoD" d* — m?¢" 
= 0,00" o* — m7 bo" + Lint 
Lint = —iQA" (6°06 — 0,0" 6) + Q? AY Abo (6.30) 
The S-matrix functional is then given by 


ri) 
F= G 
«| oe a a ‘ 


_i | 6 6 tSint 
exp ( 5) le Dw.” 55 ey Fa) € (6.31) 


6.3 Quantum electrodynamics (QED) 83 


We have used the covariant propagator for the photon, even though the use 
of the covariant propagator was justified only for the coupling A, J" where 
J” is a conserved current. The use of the covariant propagator is actually 
correct. Rather than justify it at this stage, we shall simply assume it. The 
correctness of (6.31) will emerge naturally once we have defined functional 
integrals. ie . 

Some comments on the validity of the gauge principle are appropriate 
here. It is valid for all fundamental charged particles. Indeed, it is the defin- 
ing principle for electromagnetic interactions. (And a suitable generalization 
gives other interactions as well.) It also applies to composite charged par- 
ticles for the coupling of long-wavelength modes of A, viz., those modes 
whose wavelengths are large compared to the size of the composite particle. 
For shorter-wavelength A,-modes coupling to composite particles, the effec- 
tive interaction has to be derived from the details of the composite nature. 
An example is the neutron, which has overall charge neutrality but does have 
a magnetic moment. It has a coupling to the electromagnetic field given by 
cN Yu Vv N F#”, where N is the neutron field and the constant c is to be deter- 
mined from the fact that the neutron is a bound state of three fundamental 
quarks. Clearly this is not of the form of a minimal coupling. Another exam- 
ple is the neutral 7-meson which decays electromagnetically to two photons. 
The interaction Lagrangian is effectively 


2 
eC 
Lint = —5— E-B 6.32 
ant An2 = p ( ) 
where f,, is the so-called pion decay constant and ¢ represents the pion field. 
This is again not of the minimal type. It can be derived from the fact that the 
pion is made up of a quark and an antiquark which couple minimally to the 
electromagnetic field. This will be derived later using anomaly considerations. 


6.3 Quantum electrodynamics (QED) 


Quantum electrodynamics usually refers to the theory of electrons, positrons, 
and the electromagnetic field in interaction. Electrons and positrons can be 
described by a Dirac spinor field with the Lagrangian 


L=w(iy:-d-—m)p (6.33) 


The Lagrangian obviously has invariance under the constant phase transfor- 
mation w — e’°9y. The covariant derivative can be written as 


Dyth = O,b — ieAyb (6.34) 


This corresponds to Q = —e, which is appropriate for interpreting ~ as 
corresponding to the annihilation of electrons of charge —e and creation of 
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positrons of charge e. The gauge principle now gives for the interacting theory 


L=apliy D — mp 
= P(iy-O-m)p + ebypAy 
Lint = epypAn eee x. (6.35) 


Notice that the interaction part of the Lagrangian has the form of A’J,,. 
The S-matrix functional for quantum electrodynamics (QED) may thus be 
written as 


tf Daw sa) x 
Fao (—5 | Pu aa TAL 
6 6 ie f Audy'v 
—- | ——~ = 7 6.36 
= | aeS ORR] eat) 
One can also write down the operator formula for the S-matrix for QED as 


S$=T exp (ie i Au indivi (6.37) 


where the in-field A, in has the mode expansion given by (6.12) and propa- 
gator given by (6.23). The fermion fields have the expansions 


Wert) = S> = [apr Ur(p)e*?* + bE up(p)e*?* | (6.38) 
pr 


Win (2) = a \ == [at .tir(p)e'?* - ber U (pen | (6.39) 
p,r ss 


and the propagator given by (4.41) as 


_f ap ye ptm ie 
SODA") Oa Fom +e oe 
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7 Examples of Scattering Processes 


7.1 Photon-scalar charged particle scattering 


Compton scattering, historically, refers to photon-electron scattering. We con- 
sider here the same type of process, except that we have scalar charged par- 
ticles instead of electrons. This may be referred to as scalar Compton scat- 
tering. The S-matrix functional for charged particles coupled to photons is 
given by 


F = exp| C(wmms anaes ; |» 


1,232 6y(21) dy* (x2) 
1 —_— _ 
exp {| —— Duv(x, y)=——- = dS 
p( 2 fs ole) 6A, (x) 6AL(y) 
Lint = —ieA” (p* Ou — Ouy™ Y) + e? AH A, pp* (7.1) 


To the lowest nontrivial order, viz., to second order in e, the relevant term 
in F is given by 


Fae f area (o@)6,Glnd.w) +iet [Pee (72) 


where f aug = f0,g — 0, f g. Denoting the incoming and outgoing photon 
momenta by k and k’ respectively, and by p and p’ for the charged particle, 
the amplitude, as given by the general formula (5.24) is seen to be 


(2276 (p+ k—p' —F') 
V2. /2w dw’ 2Ep2 Ey 


ian (2p! — bk) eO)(k)(2p — k’) -eOD(k’) 
M = ie? 22 eS RD (pk P= mi tie 


A= M 


_ 2p! +k’) cP (k’)(Qp + k) -e™ (k) 


(p +k)? —m? + ie ) 


Since p?—m? = 0,k? = 0, etc., we have (p—k!)?—m? = p?*—m?+k!? —2p-k! = 
—2p-k’, etc. Further since k - e)(k) = 0, k’ - e(k’) = 0, we can simplify 
the above expression to 
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— Dine ter eee 
M = 2ie* le eo? + el Tee 


(7.4) 


where we use e), e’) for e*(k), e* (k’). The cross section is especially simple 
to evaluate in the rest frame of the incoming charged particle. In this case, 
p = (m,0,0,0) and hence p-e) = p-e) = 0 since the polarizations do not 
have time-components. We then find for the differential rate for scattering 
into a range of final momenta 


|A? ae (27)45() (p +k — p! — k’) (2) 0)? dp’ dk! 
T 16ww! Ey Ey V (27)3 (27)3 


(7.5) 


The flux is given by F = ,/(p-k)?/VE,w = 1/V. Thus the differential cross 
section is 


e4 (2n)*6O(ptk—p—k) ay oye Pp Pk 
7 f ww'mE! (eo ae (27)3 (27) - 


Here E’ = E,y. The p’-integration is trivially done, identifying p’ as p’ = 
p+k—k'=k—k’. Also define the scattering angle 6 by k- k’ = ww’ cos@. 
Then 


An wmeE’ 


The remaining integral can also be done easily. The argument of the 6- 


function depends on w’ directly and through E’ = \/p’? + m? = \/(k — k’)? + m? 
Vw? + w!? — 2Qww’ cos? + m2. Thus, 
dw’ +E’) E’+w'—weosé 
du! 7 E’ 
Carrying out the w’-integration, we then get 


6S —— —E—E—EE———E— : 
mw (E’ + w! — wcos6) uy (ea) 


2\ 2 Sire )ee ae 
ae (=) es eee «| ee 
(7.8) 


where a = e”/47 is the fine structure constant and w’ is given by m+w = 
E’+w!. We may write this as (m+w—w’)? = EB? = w?+w!*—2ww! cos0+m? 
or 


= a 7.10 
w m+w(1—cos@) Gly) 
From this result, we also find that E’ + w’ —wcos@ = m(w/w’). Thus 
a? ta 2 SK , 
do = a (=) (e . e@)24a0 (7.11) 


This is the cross section for specific polarizations of the incoming and outgo- 
ing photons. If we are considering an initially unpolarized beam of photons, 
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we should average over the initial polarizations. Likewise, if the final polar- 
izations are not measured separately, but only the cross section for all final 
polarizations is considered, we must sum over all final polarizations. In this 


case 
do 1 da 
col. y 2 2 2 ey 


a aN 
2 1\ 2 
=35 (=) (1 + cos? 4) (7.12) 


where we have used the result 
kjk; 
Dare = (ay - 4) (7.13) 
X 


and a similar result for the final polarizations. 
For the total cross section, we integrate over all angles to get 


nro? [ 4 40 21+2) 

ee; | Raa Ie reine it. sae Bf D) : 

; i4oe ' @ 73 log(1 + 2) (7.14) 
where z = w/m. For small x, we can expand the logarithm to obtain 


= 87a? 
3m2 


(7.15) 


which agrees with the classical Thomson scattering cross section. For w >> m 


or large z, 
27a? 


(7.16) 


=x 
~~ 


xm 


7.2 Electron scattering in an external Coulomb field 


This is also known as Mott scattering. The interaction Lagrangian for elec- 
trons in an external electromagnetic field is given by 


Lint = ery pay” (7.17) 
The S-matrix functional is given by 
6 6 te D Aett bybap 
F=e - | aes x, y)——~ |] e M (7.18) 
lf aay agg) 
For one incoming electron of momentum p and spin state a and one outgoing 


electron of momentum p’ and spin state 6, the amplitude is given, to lowest 
order in the coupling e, as 
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m #8 . , 
A= =a if dx Ae(z)e7?-P')2 (7.19) 
ap | 
Consider an external screened Coulomb potential 
Ay = Ze - aor. A= (7.20) 
r ~ 


We consider the screened potential to avoid difficulties of integration in (7.19). 
At the end, we can take the limit a — 0. Using 


AnZ 
[oecte= v——_e 7 = aya = (7.21) 
we find, in the limit a — 0 
AnZ 
A= ite “az Han) tap2n5(E — E’) (7.22) 


where gq = p—p’. Squaring and using the trick of replacing one factor of the 

6-function by the total time 7, we get 
|Al? _ (4xZe?)?_ m? 

—— qt E2v2 


\Ugpy°Uap|?275(E — E’) (7.23) 


Since the source of the potential is at rest, the flux is given by F = |p|/EV. 
We then get for the cross section 


_|AP BV, dip! 
7 ipl “@me 
A(Ze2)2m2 
— one [tap tap|?0(E ad E’)p’*dp'dQ (7.24) 


We must have |p| = |p’| because of the 6(#— E’). Further E’dE’ = p'dp’ from 


E’ = \/p’? + m2. Keeping these in mind, the integration over the magnitude 
of p’ can be done trivially because of the 6-function and gives 


da 4(Ze?)?m 2 GP 12 
37 
= (Ze*)?m? ah 0, a2 
- dpi*sint(0/2) ied — 


where we have used q? = p-p+p’-p'—2p-p! = 2p-p(1—cos 0) = 4|p|? sin? (0/2). 
The angle @ between the incoming direction and the outgoing direction for 
the electron is called the scattering angle. 

The above result is for specific choices of initial spin (polarized incoming 
beam) and specific spin values for the scattered electrons. If we have an 
unpolarized beam and if there is no experimental discrimination of final spin 
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values, we must average over initial spins and sum over all final spins. This 
involves the quantity 


3 $y Sy lap nasa 2 =a De Uap) UaptiapY Upp! 
a=1,2 B=1,2 a=1,2 §=1,2 
= 5 5. Tr [Uaptiapy Upp tapy”] 
a,B 
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a hd [4m? + 4p,yp),( — | 
= aa PpPr\N.0%v0 — Nv Noo + NvoNu0 ) 
i 
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5 [E? +m? +p-p'] 
F?2 
os [1 — v? sin?(6/2)] (7.26) 


where v = |p|/E is the velocity of the incoming electron. Using this calcula- 
tion, we find for the unpolarized cross section 


do _ (Ze7)2 


dt = Tytan6/D (1 — v? sin?(6/2)) (7.27) 


For v < 1 this gives the classical Rutherford scattering cross section. The 
term [1 — v* sin?(@/2)] arises because of the spin of the electron. (In the rest 
frame of the electron, the source is moving. The electron thus feels a magnetic 
field with which the spin can interact.) 

The cross section diverges at 0 = 0. This is related to the infinite range 
of the Coulomb potential as seen from (7.21). This divergence can be consis- 
tently eliminated by including the finite angular resolution of the detector. 


7.3 Slow neutron scattering from a medium 


Scattering of neutrons from crystals and liquids is a useful technique for 
studying the properties of such materials. This is a problem which can be 
discussed using standard many-particle quantum mechanics, but it is inter- 
esting to apply field theory to it. The interaction between the neutron and 
the nuclei can be represented by 


Lint = ANNPP (7.28) 


where P represents the nucleus which may be bound in a crystal, for example. 
The S-matrix element is obtained by replacing the c-number fields in the S- 
matrix functional by the wave functions of the particles. So far we have used 
free wave functions for the incoming and outgoing particles. But in the present 
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case, the wave function for the nucleus is not free; in the case of scattering 
from a crystal, the nucleus or the positive ion is bound to the crystal and 
the wave function must be the corresponding one. Denoting this by Y(x), we 
find for the amplitude, to lowest order in A, 


as e922 (2)Ui(2)... .. (7.29) 


where q = k—k’, k,k’ are the incoming and outgoing momenta of the neutron 
and the subscripts i, f on the W’s refer to the initial and final states for the 
nucleus. We have used nonrelativistic wave functions for the neutron. 

Consider first the case of a crystal lattice. In this case, the wave function 
of the nucleus corresponds to an oscillating ion at a lattice site. Denoting the 
position of the lattice point by a, we have 


U(x) = eB” h(a — an) (7.30) 
E is the energy of the state of the crystal. Using this, we can write 
/ d°x eft DF (x)V; (2) 
a lis [es wv (x ma a,)yi(x ad Gn )et —o>) eee 
aa ei(Es—E:)2° (flettén |i)e*t' an (7.31) 
€,, is the position operator for the particle (ion) at the lattice site a,. Since 


the neutron can scatter from any one of the ions at the various lattice points, 
we have, for the total amplitude 


ir : ; 
A = 7, 2n5(qo + Ei — Ey) S> (flett§ |ieit an (7.32) 


The incoming flux is k/mV, and so the cross section becomes 


2m , , ; d°k! 
= S alo en t fe i\ ptd'\Qn—Am 
do =" 2x8 + B,— Ep fiom p) plete elaermon) EF 
(7.33) 


The final state of the crystal is seldom discriminated in any scattering, so the 
quantity of interest is the cross section for scattering into all final states of 
the crystal; we should sum over all final states. We have 


y 215(qo + Ei — Es) (ile | f) (fle €> i) 


a, | dae ‘r+ Pea Epanijen "© Selif) iilele Selah 
d 
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= f dederitoe? SY (ijemta en] p) (jetta ert 
f 
= | dePertm# (je tb (gta6al= ji (7.34) 


where we have used the' completeness relation \~ slF)(F| = 1 and €,,(2°) = 


etHz® Ce 2° denotes the Heisenberg time-dependent version of the position 
operator at the indicated site. Finally, the initial state is seldom prepared to 
be in any fixed pure state. Rather, one has thermal excitations of the lattice 
vibrations. This can be taken care of by summing over all initial states with 
a probability c; for state |:) or equivalently by taking the average with a 
density matrix p = }/, c;|t)(i|. Using (7.34) and averaging, we get 


A2m oon ot : , 0 . d>k! 
yea —tqoz —iq-€,,(0) ,i¢-€,,(z )| ,iq:(@n—@m 
d= ; [are in DT [pe a ef TEn( je ie ’ Gap 


(7.35) 
The summation over the lattice sites with the factor e’4'(¢—™) leads to sharp 
peaks at values of the momentum transfer corresponding to the reciprocal 
lattice vectors. At these values of momentum transfer, the factors e’7 (¢»—9) 
are all equal to 1 and we have complete constructive interference. The factor 
Tr [pe Em eiaEn(")| suppresses the peaks somewhat and gives the so- 
called Debye-Waller factor. It also, due to the time-dependence, contains 
information about processes where phonons are absorbed and emitted during 
the scattering. Further simplification of (7.35) will depend on the specific 
context. 
Consider now another medium, say, a liquid or gas. Notice that we can 
write ’ 

V7 (x)Vi(x) = (f|PP(2)|t) (7.36) 
which follows from writing out the operator P in terms of a mode expansion 
with W’s as the one-particle wave functions. PP is the density operator for 
the nuclei which we will denote by Jo(zx). (It is after all the time-component 
of a number density current.) We then find 


2 oe oe 4 74 —iq(z—y) /; F 
IAI" = Fa | tadye (i|Jo(w)|F)(F| Jo(@)|4) (7.37) 


Summation over final states gives the quantity (i|Jo(y)Jo(x)|t). We can sim- 
plify this as follows. 


(i|Jo(y)Jo(a)|i) = (ile*P¥ Jo(O)e*P?—™ Jo(O)e~*** |i) 

(Jo Ole Tele) 
(i|Jo(0)e*P?— Jo(O)e*P@™ |’) 

(i|Jo(0)Jo(x — y)|#) (7.38) 


I 
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where we have used the energy-momentum operator P = (H, P) to trans- 
late the arguments of the Jo’s to the indicated points. (This P is not to 
be confused with the field operator for the nucleus.) p; is the eigenvalue of 
energy-momentum for the initial state |i). Using this result and changing 
variables from x, y to x — y, y we can do the integration over y in (7.37). The 
cross section can thus be written as 


4 


A2m d3k! 
ig ae (am) (7.39) 
Sa i dx e~% (i|Jo(0)Jo(z)|A) (7.40) 


S(q) is the two-point function for the density operator. We see that it can be 
measured by neutron scattering. 

In some situations, such as slow neutron scattering which is essentially 
elastic, it is a good approximation to take Jp to be independent of time. In 
this case 


S(q) = 216(ko — ky) S(q) 
SoS ( daz © (i| Jo(0) Jo(x) i) (7.41) 


S(q) is called the structure factor. 


7.4 Compton scattering 


We now consider Compton scattering with spin-$ charged particles, for ex- 
ample, the scattering of photons by electrons or positrons. It is described 
by the QED S-matrix as given in (6.36). The relevant term of the S-matrix 
functional is 


6 6 bey 
F= exp If says z, | ete [ Andy % 
NC TTC) 
= (ie)? [ H(e)7"S(e, yyy" WW) Au(@)Ar(y) (7.42) 
Upon replacing the w,7 and the A’s by the one-particle wave functions ac- 


cording to (a generalization of the) formula (5.24) we get the amplitude for 
the process as 


4/ m? 4 
ey (4) et wees 
A Wa0V 20" (27)°5 (p+k—p'—k') M 


: ' i 
M = ie? |dgyy - ee) —__—___ 4+. el) 
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where we have denoted the incoming and outgoing electron momenta by p, p’, 
respectively, and the corresponding photon momenta by k, k’. Subscripts a, 3 
on the spinors refer to the spins of the incoming and outgoing electrons and 
e) e’) refer to the corresponding photon polarizations. The kinematics of 
the process remain as in the discussion of photon scattering with a scalar 
charged particle. If we consider the rest frame of the initial electron, then the 
flux is 1/V and we get 
; 2 
- (=) IM\2aa (7.44) 


1672 


where, as before, 
; mw 


cages m + w(1 — cos @) 


There is no significant difference with the scalar case until this point, except 
of course that M is different. 

The cross section (7.44) applies to fixed incoming polarizations for the 
photon and electron and for scattering into specified final polarizations. For 
unpolarized incoming electrons and where final polarizations are not mea- 
sured, the quantity of interest is the above expression averaged over initial 
spins of the electron and summed over final spins. The result can be written 
as 


(7.45) 


1\2 1 
do = 0? (=) : Ss IN 24a (7.46) 
N= tippy Of” Uape> eY) + tgp OF “tape e) 
1 
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The square of NV will involve four types of terms. In general, when we square 
a term like ugOu, we get 


= 0 = ey 
ae Usp’ Ouapt,yO'y Coy = ae Upp! OUaptapOugp' 
a, aB 


= (m+7-p) ,(m+y-P') 
-rjor toe (7.47) 


where we have used the fact that y°O'y° = O for the combinations of 7- 
matrices we have, which follows from the fact that y’ are antihermitian and 
anticommute with y° which is also hermitian. We thus get 


Am? sa ie jes*? O,-e€Y (m+ 7-pye™ -O,-e?? (m+ -p')] 
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ee (7.48) 


For the traces of the y-matrices, we have the formulae 


Tele" | = Agee 
Tela at | aay — eee 
Tr [ts yH2 ae yan] = 71 2 Ty [3 Ha see @ al 
oe. nt ee Try 4yh* 506 nyt} 
+ ttle Tel yHe Hays... Han]... (7.49) 


These rules follow from the basic Clifford algebra relations y4y” + y’y? = 
2n"”. The last result is a recursion rule which can be used for higher numbers 
of y-matrices to reduce such traces to lower ones. Using these rules, the 
relevant traces in (7.48) can be evaluated, although this is rather tedious. 
The result, in the rest frame of the incoming electron, with p = (m,0,0,0), 
becomes 


w! wW , 
ae hy (A). @("))2 _ 
Ne — pag =(- me on + 4(e? . el 7) | (7.50) 
The differential scattering cross section (7.46) can now be written as 
doa? fu'\* [ul wv , 
a eee ale ag ic (A). eA’))2 _ 
In G2 (=) F a os +4(e) .e*?) | (Fal) 


This is the Klein-Nishina formula. In the nonrelativistic limit or when the 
photon energy is small, we have w’ = w, and this reduces to the Thomson 
cross section 


d 
a ae EO e’))2 (7.52) 


The expression (7.51) applies to the case where the electrons are unpo- 
larized, but the photons are polarized. We can get the cross section for all 
final polarizations of the photon and for unpolarized incoming photons by 
summing over final polarizations and averaging over the initial polarizations. 
The use of (7.13) then gives 


= =e: e))? = 5 (1 + cos? 6) 


Lyfe 2a) afte 9 (7.53) 


w! 


Using this in (7.51), we find for the unpolarized cross section 
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ge eee 
410 one (=) E 1 Whe sin q (7.54) 
Integration over all angles gives the total cross section. In the nonrela- 
tivistic limit, it is the Thomson total cross section. 


a(w) OT, = Bont (7.55) 
More generally one has 
o(w) =orn F(w/m) (7.56) 
where 
F(w/m)- 1 as) eo 
m 
3m 1 log(w/m) W 
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> ee og(2u/m) +5 +0 ( aa as — — 00 


(7.57) 


The result that the cross section agrees with the Thomson cross section 
when the photon energy is low is the same as what we found for the scalar 
particle. Our calculation is only at the lowest nontrivial order in the coupling 
constant e and only for spins-0 and -5. This reduction to Thomson cross 
section is, however, universal. Based on the gauge invariance of the electro- 
magnetic interactions, one can prove the following low-energy theorem. The 
exact cross section for Compton scattering reduces to the Thomson cross 
section as the photon energy w goes to zero. Of course, we have (Q?/4z), 
instead of a, for a particle of charge Q in the formula for the cross section. 
This theorem is very general and holds even after including renormalization 
effects and contributions from other hadronic intermediate states and so on. 
It can be used to define what is meant by the charge of the particle, viz., we 
can say that the charge Q of a particle is defined by the formula 


(Gr e  ntoto) (7.58) 


where o(w) is the Compton scattering cross section. (For a related low-energy 
theorem, see Chapter 11.) 


7.5 Decay of the 7° meson 


The 7°-meson (or the neutral pion) is a neutral particle made of a quark 
and an antiquark. It decays into two photons. The interaction Lagrangian 
responsible for this decay can be shown to be 
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ieee (7.59) 
uf 
where ¢ is the pion field. a is the fine structure constant, and f = fr ~ 
93MeV is the pion decay constant, which can be measured in the leptonic 
decays of the charged pions. The interaction (7. y will be derived in the 
discussion of anomalies later. nq 
We consider incoming pion momentum p and outgoing photons of mo- 
menta k,,k2 and polarizations e) e(). The amplitude is obtained by re- 
placing the fields by the one-particle wave functions in e*Sint, We find 


_ (20/5 (p — ka = ba) 
S V/ 2po V 2w 1 V 2weV 


M = ~cje( baton — Raitrese\™ (7.60) 


The decay rate is given by |.A|?/7, where we use the usual trick of replacing 
the square of the delta function by one delta function and a factor of Vr. As 
regards final states, there is an additional factor of 2. This can be seen as 
follows. The final state has two photons, which are identical bosonic particles. 
The state ((k1,e), (k2,e’)) is not distinguishable from ((k2,e’), (ki,e)). In 
summing over final states, if we do an unrestricted sum over momenta and 
polarizations, there would be double counting. We can remove this by dividing 
by 2. Thus the total decay rate is given by 


final 
d?k, d®kp (27)45) (p — ky — ke) 2 
x | (27)° (27)? 16pow1w2 pala _ 


A,r! 


The k-integration is trivial because of the delta function, giving kz = p—k}. 
In the following, we shall choose the rest frame of the pion, so that p = 0, pp = 


m, and hence k = ki = —ko and w =w = we. We also have 
Be |M|? = WE gr ky e; er] 
Dime = OI 
8a7w4 
= “72 fz (7.62) 


The decay rate is thus given by 


r= 8a7%w* 2nd(m_z —2w) dk 
‘_ 16m,,w2 (27)3 


sr (7.63) 


7.6 Cerenkov radiation 97 


The numerical value given by (7.63) is approximately 7.63 eV to be compared 
to the experimental value of (7.37 + 1.5) eV, which is not bad as we have 
made some assumptions such as ignoring the composite nature of the pion. 


7.6 Cerenkov radiation 


The basic process involved here is single photon emission by a charged par- 
ticle. Such a process is kinematically forbidden in vacuum, but in a medium 
it can happen if the velocity of the particle exceeds the velocity of light in 
the medium. While the dielectric constant itself arises from interactions of 
the photon with the charged particles in the medium, for the purpose of dis- 
cussing Cerenkov radiation, we can have an effective description where the 
medium is assigned a dielectric constant e(w). In this case, we have for free 
photons, k? = e(w)w?. Let p,p’ denote the initial and final momenta of the 
particle and let k denote the momentum of the emitted photon. Conservation 
of energy-momentum gives 


p=p+k 
E=E'+w (7.64) 


Squaring the second of these and using the first, we find 


_ Ew , (€—1)u? 
cos 6 = a pe 
1 (€-—1)w? 


———— 7.65 
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where @ is defined by p-k = pk cos@ and v = p/E is the velocity of the particle. 
As € — 1, the right-hand side becomes larger than 1 with no solution for 0 
and the photon emission is kinematically forbidden. In the medium, light 
is emitted on a cone whose angle is given by (7.65). For « = 1, we can 
approximate the above relation by 


cos 6 = (7.66) 


i 
uve 
Of course, we need v,/e > 1 for this to make sense. (Actually, the second term 


in (7.65) is a quantum effect and hence (7.66) is all that appears classically.) 
We now turn to the dynamics. The photon Lagrangian, with Ao = 0, has 


the form re 
L= ; c (#) = a] (7.67) 


The canonical momentum is I; = €A;. The expansion for A; is now 


98 7 Examples of Scattering Processes 
= Lawes” up (x) + ah ee uz (x) 


l ik 
= kr . 7.68 
ur(z) V2 (é me ( ) 


where a,at have the usual commutation rules. The factor of \/e is required 
in the denominator of u,(x) so that I7,A have the standard commutation 
rules. The Hamiltonian can be written as 


= i calane : (7.69) 


We take the charged particle to be a scalar, so that the interaction term 
for one photon emission is 


Lins = —Stegeonea eee | (7.70) 


The amplitude for single photon emission is thus given by 
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where we have used the fact that p’-e = (p+k)-e = p-e since k-e = 0. The 
photon emission rate is given by 


Al? e?_ (p-e 
AY = 2S en's p-y'-& (7.72) 


The rate of energy loss is obtained by multiplying the above expression by 
w since the energy loss per photon emission is w. Putting in the final state 
summations, we find 


dE e* (p-e dp! 
dt ~ ie oj aan On) a a (2n)3 


= of (1 — cos* 0)5(E — \/(p — k)? + m? — w)k*dkd(cos 6) 
(7.73) 


where we have carried out the integration over p’ and also used the fact that 


one 2_ (p-k)? 
dP ep-e=p s 


= p*(1 — cos” @) (7.74) 
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We carry out the angular integration; this sets the value of 6 to that given 
by (7.65). For the remaining integral over k, we then find 


a — cos? 6) (7.75) 
Now, approximately, 


1— cos? 6 ~ (1- =x) 
ve 
kdk = ewdw (7.76) 


where we neglect 0¢/Ow. Also, the final result is usually expressed as energy 
loss per unit length of the path; this can be done using dx = vdt. Combining 


these results oe 
1 


This is the Frank-Tamm result for energy loss by Cerenkov radiation. 


7.7 Decay of the p-meson 


The decay of the neutral o-meson into charged leptons can be modeled by a 
Lagrangian which incorporates mixing between the p and the photon. The 
Lagrangian for the p,,,e*,e~ and the photon is given by 


= Loe Lint 
Lo = ip pe lig - A)? — : pvyO" p’ + 129? 
ate 2 pone D 
+ o(iy-8—m)p 
Lint = Mp -A+epy: Ay (7.78) 


with 0: p = 0. The p — A term is consistent with the mixing of the p and the 
photon in the so-called vector dominance approach to p-meson interactions. 
The constant g has the numerical value, g ~ 5. The p-meson has many 
channels for its decay. The Lagrangian given above can describe the decay of 
a p-meson into an ete -pair. The partial decay rate for this mode can thus 
be calculated using the Lagrangian (7.78). 

The amplitude for the decay of the p of momentum p and polarization 
e) into an ete~-pair of momenta k, k’, is given, to the lowest order in the 
coupling e by 


4 5(4) ‘a 2 7} iB 1 
An CHEE AM SD) (yim ep) Gattow) §=— (7.79) 


7 
/2upVERVERV 9 p? 


100 7 Examples of Scattering Processes 


The factor of 1/p? arises from the photon propagator due to the oo con- 
traction of the A’s in the expansion of e**"*. For this calculation, p* is equal 
to M2. We will calculate the decay rate for an unpolarized p-meson; there 
are three polarization states and the average over the polarizations can be 
done using 


So Me™ = —Nwy + ar ee (7.80) 


The decay rate for an unpolarized p-meson with a small range of final mo- 
menta and all possible final spins is given by 


/ 3 3 Jf 
0) 4p pp 208 Vk+k —p)PkPk 
Sm fs Yer ee (781) 


Using the polarization summation formula (7.80), we find 
ol 
So leer eM uj, |? = 6 a} (7.82) 
A 2,6 


For a p-meson at rest, p = (M,0,0,0) and k’ = —k, Ex = Ex. In this case 
the decay rate is given by 


sien 4k?\ 5(2E, —M 
re 5 [ oe (0+ 5) 
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= iLL WY Be 83 
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where @ = e”/47. Since the electron mass is small compared to the mass of 
the p, we can simplify this to 

_ 4na? M 

3g? 


(7.84) 
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8 Functional Integral Representations 


8.1 Functional integration for bosonic fields 


We have discussed the notion of functional differentiation earlier. Here we 
shall discuss the notion of functional integration. A certain function space F 
was defined in Chapter 2. For integration, we need a volume measure on the 
function space F. We can do this by specifying a distance function or metric 
on ¥. For a real scalar field y(x), the distance between the configurations 
y(x) and v(x) + dy(x) may be taken as 


ds? = |6yI? = [ate (69) (8.1) 


If we expand y(zr) in terms of a real orthonormal basis, i.e., g(x) = 
Yon Cntin(z), the coefficients c, are real and we have 


ds? = |\5g|? = )(6en)? (8.2) 


We see that the distance defined by (8.1) is Euclidean. This would be ap- 
propriate for real-valued scalar functions. The metric will in general depend 
on the nature of y. For example, consider maps from the spacetime region 
»} to the two-sphere S?. Using (6,«) as the coordinates of S? (where a is 
the azimuthal angle), the maps we are considering are 0(x),a(z). On the 
corresponding function space, the metric should be of the form 


ee | d'z [(60)? +sin?6 (6a)?] (8.3) 
Das 


Since there is a choice of c, for each function, and each function corre- 
sponds to a point in F, cy, can be taken as local coordinates on F. Once 
we have a set of local coordinates c, and a metric tensor gnm, the volume 
element is given by 


[dp] = dV = V/detg dcjdc2-:- (8.4) 


In general, since we have an infinite number of c,’s, we must specify the 
volume measure by a limiting procedure, i.e., define 
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dV) = s/det g(™) deidez +++ den (8.5) 


where we cousider N modes with a corresponding (N x N)-matrix gnm. Then 
dV = lim dv) (8.6) 
N—0oo 


The finite mode version of the metric is referred to aS a regularized metric. 
Generally, all metrics on function spaces have to be defined with proper 
regularization. 

The integral of interest to us is a Gaussian integral, which we now evalu- 
ate. Consider 


[= if [dp] exp I-3 [ d‘ad*y y(x)M (a. wolw) (8.7) 


Introducing a mode expansion in an orthonormal basis given by y(z) = 
>, CnUn(x) as before, the exponent in the above equation can be written as 


[ atedty oe) M(@, wou) = Yo enMnmen (8.8) 


where 
Mom = iL dted*y tin(x)M (2, y)tm(y) (8.9) 
ps 


We are interested in situations where M,,,, is diagonalizable with eigenval- 
ues which have a positive real part. We can then choose an orthonormal 
basis which diagonalizes M; i.e., we consider eigenfunctions f,(x) of M(z, y) 
defined by 


[av M(@u)falu) = Anal) — @:10) 


Expanding (x) as y(r) = 30, anfn(x), we have ||dy||? = >>, (dan)?, dV = 
[[ dan, so that 


N 1 N 
I= aim, / [[ldon exp =} oad 
_ ay [an 
= Le 
oo 1 n 
= 
M 2 
= | der 
ay 0 


The determinant should be defined by the limiting procedure of first eval- 
uating det M“™) and then taking N — oo. This method of truncating to a 
finite number of modes N and then taking the limit of N — oo to define 
functional determinants, and functional integrals in general, is referred to as 
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a regularization procedure. Since the result involves the determinant which is 
independent of the basis, it is clear that our choice of the diagonalizing basis 
{fn(x)} is not a restriction. 

We now consider a related integral 


11] = | tdglexp 3 ih dhad'y (x)M (x, y)o(y) + a dts Haye(o) 


(8.12) 
This can be evaluated by completing the square in the exponent. We find 


IJ] = [late tS omer Jp 
a | aiid call 


= face ores f00- 


1 
M\172 
= ace (=) exp 5. f. d‘xd*y J(x)M—*(a, y)J(y)| (8.13) 
where we have used the translational invariance of the measure to shift the 
variable from y to (p — M—1J). 
We have considered real functions so far. For a complex scalar function 


v(x) we can write v(x) = (y1(x) + ive(x))/V2, where v1, v2 are real. Then 
the following result emerges from what we have done so far. 


Had \ = [ldeapien Somers To+@J 


- act (#)| ef Imo (8.14) 


Since there are two scalar fields, we get two copies of [det(M//27)]" 2 


8.2 Green’s functions as functional integrals 
The integral in (8.13) can be used to obtain a functional integral representa- 


tion for the generating functional Z[J] of the N-point functions for a scalar 
field. The free part of the action for a scalar field y can be written as 


S = f dtc} [(p8p) — my?) 
- [ ds $o(c)(0+m)9(@) (8.15) 


We need to consider the integration over all fields of exp(iSo). So is real and 
hence such an integrand is oscillatory. We shall therefore consider the integral 
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of exp(iSo — 4e f y*); €, which is a small positive real number, is introduced 
for convergence at large values of the field. In the end, it can be set to zero. 
We have 


iS —tefe?=-5 f veM@vel) 


M(sa, y) = (Ce + m? — ic) (a ~y) (8.16) 
If G(z, y) is the Feynman propagator, we have 
i} d4z M(a,2)G(z,y) = 6 (e —y) (8.17) 


which shows that G(z,y) is the inverse to M(z,y). Direct application of 
(8.13) then gives 


[tae exp [iS — ge fy? + f Jy] =C exp E [atady HayG(a,)Ju)| 


(8.18) 
where the C stands for the determinant in (8.13). It is independent of J. The 
right-hand side, apart from this constant, is just the generating functional 
Zo(J| for a free scalar field theory. In other words, we have shown that 


PAR / (diol exp [Sp — de fy? + f Jol (8.19) 


It is interesting in this context to note that the e-term was introduced in 
the functional integral for convergence at large values of the field and at the 
same time it helps to pick out the Feynman contour for the Green’s function, 
leading to the propagator in the formula (8.18). The choice of the Feynman 
contour is crucial for the integrand of the propagator to be continued to 
Euclidean space. Likewise, the functional integral may be defined in terms 
of the Euclidean action and then continued to Minkowski space and the 
convergence factor ie then appears naturally. The reasons for ie in the two 
contexts are clearly related. 

Consider now the interacting field theory. For a y*-interaction, we have 
shown in Chapter 5 that the generating functional Z[J] is given by 


Zee -a [os (aa) | ZolJ| (8.20) 
tgmes(s.10) tor Zl] worn 
~id f ate Gata) | [lacie [iso — defo? + f Jo] 
=" [[aclex lis —ir fo - bef? + f Jo] 


=N" f (ao) exp [iS — fy? + f Jy] ~ (8.219 


Z| J] = N’ exp 
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where 


y= pas [$(Opd¢) — dm?y? — Ay | (8.22) 


is the full classical action for the interacting theory. The constant NV’ is 
determined by the requirement of Z[0] = 1. 
We can generalize (8.21) to any polynomial type of interaction as follows. 


Z(J] =N fide exp [iS — Se fy? + f Jo] (8.23) 


where S is the classical action for the interacting theory and the normalization 
factor is determined as 


N= | [dplexp [iS — 4e f 97] (6.24) 


This result can also be directly derived as follows. We have seen in Chapter 
6 that the operator equations of motion, for the y4-theory, are 


(CO, + m?) (x) + 443 (x) = 0 (8.25) 
The canonical equal-time commutation rules are 


[o(a°, x), d(x, y)] = 0 
[n(x°, x), d(x, y)] = -i 6 (a — y) (8.26) 
[a(x°, x), (2°, y)| =( 


where 7(x°, x) = Oo¢(zx°, a). Based on these, we derived, in Chapter 5, the 
equation obeyed by Z[J] as 


52[J] ie é 

- +4 | —— }] Z[J)] = -iJ (x) Z| J 8.27 
Cem? ig SE san(s F<) zi|=-iste\zis) (621) 
Instead of solving this, as we have done, in terms of the free theory after 
separating off the interaction term and obtaining (8.20), we will show how it 
arises in the context of functional integrals. Notice that we have the following 
identity: 


fiael 5 =o [iS — Se fy? +1] =.() (8.28) 


The integrand in (8.28) is a total derivative and so we can do the integral and 
express it in terms of the values of the integrand at the boundary of y-space, 
ie., at large values of |y|. The integrand vanishes for large |y|, because of 
the exp(—4e f y?)-term and this leads to (8.28). Writing out the derivative 
in this equation and noticing that 
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é , 
F (=) fie exp [iS — de fy? + f Je] = 


[laciew) exp [iS — de fy? + f Jo] (8.29) 


we get 


nei) js 
io +m? — ‘aay + 4. (5) + 7) x 


[tae exp [iS — de fy? + f Jy] =0 
(8.30) 


Comparing this with (8.27) we see that the functional integral of (8.23) gives 
the solution for Z[J]. 

Equations (8.23, 8.24) are rather remarkable. They allow us to bypass a 
lot of the operator formalism of the quantum theory and go directly from the 
classical action for the theory to the generating functional for the N-point 
functions, and hence to the S-matrix. And all manipulations are “classical”, 
i.e., just integrals. As we shall see in the next chapter, things are not so sim- 
ple; the need for renormalization of various coupling parameters will require 
the use of a slightly different version (or rewriting) of the classical action 
in practical computations. But this modification is still rather minor and 
(8.23, 8.24) give a way of proceeding directly from the classical action to the 
quantum theoretic results for the N-point functions and the S-matrix. 


8.3 Functional integration for fermionic variables 


We must now discuss a functional integral representation which is suitable 
for fermions. The functional integral representation, not surprisingly, is pro- 
vided by integrations over anticommuting c-number functions or Grassmann 
variables. 

We start with a definition of integration over Grassmann variables. Con- 
sider first the case of one variable 7. Since any function of 7 is of the form 
f(z)o+f(xz)1n, we have to define only { dy and [ dn 7. The definition of inte- 
gration will be formal, but will be consistent with the expected behaviour of 
definite integrals over the entire range of 7. Consider f dy 7. Since our formal 
definition of integration should mimic integration over all 7, this should be 
invariant under translations of 7. Thus we require that 


[era+e)= fann (8.31) 


for any Grassmann number a. This tells us that f dy = 0. f dn 7 need not 
be zero. We define it to be 1. Thus the rules of Grassmann integration are 
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fa i fa aol (8.32) 


More generally, if we have N Grassmann variables 7;, we find, by repeated 
application of the above result that all integrals are zero except the one whose 
integrand involves the product of all n’s. The only nonzero integral is 


/ dnndnn-1...dm mM2...nn = 1 (8.33) 


For a more general ordering of the 7’s, from the antisymmetry property of 
the 7’s under commutation, we get 


/ [dn] Nis Nia---Nin = Cinta...tv (8.34) 


where [dy] = dnndnn-1...dm. 

The integral of interest to us for field theory calculations will be a Gaus- 
sian integral. Consider N variables 7; and N variables 7;, all mutually anti- 
commuting. (7; are independent Grassmann variables, they are not the “con- 
jugate” of 7; despite the notation.) We can now evaluate the integral 


I= i, [dnd7j] exp (- amin (8.35) 


The only term that contributes, by (8.33), should have N 7’s and N 7’s. 
Expanding the exponential and using (8.34) 

4 1 

[= (1) BNO) inte tw infty Minn Mian Miwiy 


= (—1)?4(+) (det M) (8.36) | 


The generalization to functions is given by 


I= i: [anda] exp (- / n(x) M(z, u)ntu)) 
= (det M) : (8.37) 


We have ignored the overall sign since it will not be relevant for us. As in the 
bosonic case, the determinant has to be evaluated in a regularized way, i.e., 
it should be evaluated for a finite number of modes and then the limit of an 
infinite number of modes should be taken in an appropriate way. 

Finally, consider the Grassmann-valued functions (x) and 7)(x) and (z), 
n(x) and the integral 


I{n,7\ = : [dar osn( - i oe (x, y)b(y) + [ (n(x)b(x) + Heme) 
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= lavas) exp (— {= am~) mw - M~n)) exo ( | nM) 
= (det M) exp ( / nM “n) (8.38) 


These results can now be applied to fermionic fields. Many-fermion prop- 
agators have been discussed by introducing a generating functional 


Z(n.q] = (Ol exp ( [w+ in) 0) (8.39) 


The sources 7,7 are spinors and are Grassmann-valued. They anticommute 
with 7, ~ as well. The expression for Z({n, 7] was given in Chapter 4 as 


Aine | [ dted'y aloste, mo) (8.40) 


where we have chosen the normalization Z(0,0] = 1. 

The integral (8.38) can now be applied to obtain a functional integral 
representation for Z[n, 7]. Taking M(z, y) = (—i)(iy - 0, — m)6“ (x — y) the 
integral in (8.38) gives 


I{n, A] = | (ddd) exp] — b(x)M (x, ee b 
inal = [[dvdd) e r( / Fema vv) + if (al te) + Herta 
= (det M)ZIn, a] (8.41) 
We can write 
zs ft B()M(e, yoy) = 5 de G(iy-O— mv 
zy 


= iS[b, v] (8.42) 
where S{w, 1] is the action for the Dirac theory. The integral formula for 
Z|n, 7| becomes 

Zinn) = [laveed| exp [iste] + f ate (awe) + Henle) 


(8.43) 

The generalization of this result to interacting fermion theories is entirely 

straightforward. We find in all cases, involving both fermionic and bosonic 

fields, that the generating functional for the N-point functions may be written 
as 


Z(J,n, 7] = N i [dydypdy] exp isto 


+ f dx (J(o)o(2) + m(e)v(@) + Hla)n(a)) 


(8.44) 
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where, in the integral, the fermionic variables are Grassmann valued and the 
bosonic fields are ordinary real- or complex-valued functions. The normal- 
ization N is to be fixed by the requirement of Z[0, 0,0] = 1. Thus given the 
classical action for the fields, we can bypass most of the operator formalism 
and proceed to the generating functional for the N-point functions by making 
use of the above formula. In some sense, it may be taken as the definition of 
the quantum theory of the fields. 


8.4 The S-matrix functional 
Since we have a functional integral representation for Z[J], it is clear that 


we have a similar representation for the S-matrix as well. In Chapter 5, for 
a scalar field theory, we obtained the formula 


Fiel=exn|-3 foto+m yo] zlC+mye] (8.45) 


We now use the functional integral (8.23) for Z[J]. The action can be split 
as 


S(x) = f ate [$(0x)? ~ $m?x4] + Sine(x) 
= So + Sint (8.46) 
We complete the squares in S(x) + fix(Q+ m?)y to obtain 
Zli(+-m?)] =N | faxlexp[iSolx— 9) - def? 
+4Sint(X) + ; / y(+ my] 
=N / [dx] exp [iSo(x) — he fx? + iSine(x + 9) 
+5 [ eO+m)] 
=e [5 foamy] ( explisinx+ Do 
(8.47) 


We made a shift of the variable of integration y as x — x + y in the second 
step. The angular brackets denote functional average over x’s with just the 
free part of the action; i.e., 


(O)p=N / [dx] exp [éSo(x) — ge fx?] O (8.48) 
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We have also omitted some terms of order € which can be set to zero without 
affecting convergence of the integral. Using equation (8.47) in (8.45), we get 


Fle] = ( exp [iSine(x + ¥)] )o (8.49) 


This equation relates F[y] to the free average of the interaction term in the 
action S. We can directly expand the exponential to generate the perturba- 
tion series for the S-matrix. Being a Gaussian average, the average of a prod- 
uct of x’s will factorize into the products of two-point functions (x(z)x(y))o 
with suitable symmetrizations. In the equation for the averages and the S- 
matrix, we have included the full normalization factor VV for the interacting 
theory. One can use the normalization factor for the free theory to define 
the functional averages provided the S-matrix is properly normalized by the 
condition F[0] = 1 at the end. Notice also that equation (8.49) is a way of 
rewriting the operator formula (5.85). 

The above derivation, although given for the scalar field theory, can be 
easily extended to more general theories and one can write the S-matrix 
functional as 


Fly, au, ¥, ¥] = (exp [tSine(X +9, Ap tan,%+U,%+%)]). (8.50) 


x, Ay, YW and W are the fields which are integrated over with the free action 
in the measure to define the averages in this equation. 


8.5 Euclidean integral, quantum electrodynamics, etc. 


The propagators and the N-point functions can be obtained, as we have 
discussed in Chapter 4, as the Minkowski space continuation via +* — ix® of 
the corresponding results in Euclidean space. A Euclidean space version of 
the functional integrals is very useful, especially for higher-order calculations. 
We define the Euclidean free field actions for a scalar field and a fermion field 


Sr0 = [ats [50007 38 amp? + oy: d+ m)| (8.51) 


where all scalar products are taken with the Euclidean metric du, and the 
y-matrices obey YY + Ww = 26uv- The Euclidean version of (8.18, 8.41) is 


[laeavai| exp |-Seo(o,,v) + f de (Jp +964 in| 


= Cexp| fate (5.1(2)Ga(e,v) J) +02) Se(0 u)ntw)) 
(8.52) 


where 
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d*p —— 
Gr(z,y) = (Qn)4 ae p(e—y) 
d* 1 ; 
Se(ay)= | = eiP(a—v) (8.53) 


(27)4 iy-p+m 


We may write, more generally with interactions, for the generating functional 
Z in Minkowski space 


Z|J,A,n) = ZelJ,An| 


ZelJ,7,n] = N ‘/ [dpdydy] exp sete. pw) + i" d*x (Jp + iv + bn) 
(8.54) 


Now we turn to quantum electrodynamics (QED). As we have shown 
before, the photon propagator, for calculations in QED, may be taken to 
have the covariant form given in equation (6.23), namely, 


4 . 
D (x y) =7 aki e—tk(a—y) (8 55) 
a. md (Qn)4 k? + te 


The corresponding Euclidean propagator is given by 


Dey oa [& ny ae ehev) (8.56) 
The Euclidean functional integral for QED is thus given by 
Bel J, Hon|= Nf {dAavad exp [Se ei jes (Ags” + Habe a) (8.57) 
where 
Se(A,i0) = f dx [5 OnAvdpAs) + 9(1-(O-ieA) +m) u] 


= f ez F FF += 5(0- A) + (7-(0 ied) +m)¥) 
(8.58) 


N is, as usual, fixed by the requirement Z(0,0,0,] = 1. The integration over 
A’s in (8.57) is done with the standard Euclidean measure on the space of 
A’s. 

The general idea of the functional integral being given by the integration 
over the classical action, as discussed after equation (8.44), would suggest 
the use of the Maxwell action [ 4F?, or its Euclidean version, in the for- 
mula (8.57). The fact that one has to use the action as given in (8.58) has to 
do with gauge transformations. Since A, and its gauge transform A, + 0,0 
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are physically equivalent, there is redundancy in the choice of field variables. 
For integration over the Maxwell action, one must then use a measure where 
this redundancy has been removed. The use of such a measure, which we 
shall discuss in more detail in the chapter on gauge theories, will lead to 
the above formulae for QED. In summation then, for all theories, the Eu- 
clidean functional integral for the generating functional can be written as the 
integral over exp[—Sg] with integration over the space of all physical, i.e., 
non-redundant, field configurations. (For QED there is a further averaging 
over gauges needed to get to the formula (8.57), see Chapter 10.) 


8.6 Nonlinear sigma models 


Nonlinear sigma models are an important class of field theory models. The 
functional integral for these theories has a nontrivial measure for integration 
over the fields. It is interesting to derive this following our general approach 
of using the operator equations of motion and the equal-time commutation 
rules. 

Consider a general Riemannian space M, which may have nonzero cur- 
vature, with coordinates y“4 and metric 


ds? = Gapdyp“dy® (8.59) 


The metric tensor Gg is in general a function of the coordinates oA, The 
action for a point particle moving on such a space can be taken as 

1 dip4 dy® 

oi " dr ~-Gas———— 6 

2°48 ar dr 680) 
where 7 parametrizes the trajectory. The classical equations of motion are 
the geodesic equations, the classical trajectories are geodesics. As a general- 
ization of this notion one can consider fields y4(x) which give a map from 
the spacetime to the Riemannian space M. This space M into which we are 


mapping from spacetime is often called the target space. The action is then 
taken as ApoE 
= —qdtr gt” 1 op" a 

s= | y= rg 5 GAB aoa Fe 

The action is determined by the metric of the target space. For generality, 

we have written g*” for the spacetime (inverse) metric. The formulation of 

the theory along the lines given here is applicable for the case of a general 

spacetime which may have nonzero curvature as well. One can also consider 

Kuclidean signature for the spacetime. For flat Minkowski spacetime, which 
we consider for the rest of this section, g4” = n¥”. 

Field theories defined by the action (8.61) are called nonlinear sigma mod- 

els or sometimes, they are called chiral models. (The reason for the name 


(8.61) 
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“sigma models” is historical; it arose in particle physics literature, in the 

context of chiral symmetry breaking, from a linear field theory model where 

there was a field which was designated by o; the nonlinear model resulted 

from taking the mass of o to be large compared to the momenta of interest.) 
The classical equations of motion for this action are 


a [, O98]  18Gzc dp? dC _ 


The solutions to this equation form a special class of maps from the spacetime 
to the target space M which are a generalization of the notion of the geodesic. 
Such maps are called harmonic maps in the mathematical literature. 

The importance of this theory in physics has to do with Goldstone’s the- 
orem. Consider a field theory which has a continuous global symmetry cor- 
responding to a Lie group G. (A symmetry is global if the parameters of the 
symmetry transformation are constant in spacetime.) If the vacuum state 
does not have this symmetry G, but is symmetric under a subgroup H, we 
say that the symmetry G is spontaneously broken down to H. A typical exam- 
ple is the Heisenberg ferromagnet for which the action has three-dimensional 
rotational symmetry, but the ground state, which has spontaneous magne- 
tization along some direction, breaks this rotational symmetry. Goldstone’s 
theorem tells us that there will be massless particles corresponding to each 
broken symmetry. There are (dimG — dimH) such massless particles called 
Goldstone particles. Further, since the particles are massless, at low ener- 
gies, where one does not have enough energy to excite massive particles, 
we essentially get the dynamics of the Goldstone particles interacting among 
themselves. The action for the Goldstone bosons is given by a nonlinear sigma 
model of the form (8.61) where the target space is the group coset space G/H. 
Thus nonlinear sigma models are important in all physical contexts where 
spontaneous breaking of continuous symmetries occurs. (The phenomenon of 
spontaneous symmetry breaking is discussed in detail in Chapter 12.) 

In this section we will just consider the functional integral for the sigma 
model following our general derivation of the functional equation for Z[J] 
which is defined in the usual manner as 


Zid = (ol T exp( f Jap) 0) 
=( T exp @ Ja¢*) ) (8.63) 


The canonical commutation rules for the fields are 


[ea 2), o @ yi —0 
[y4(2°, 2), 13(2°,y) ] = 1 6g 6 (x —y) 
[ 14 (2°, a), 13(2°,y)] =0 (8.64) 
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where II, = Gapd"yp®. We also define 


F(a°, y°) = T exp (/. is [ &xrereo) (8.65) 


By direct differentiation we check that this obeys 


soa Flat.) = | f Padate.2)04@.2)] FW) 
oo Fa®.a) = — Rev) | f Padaly’,2)94.2)] (8.66) 
Using this quantity we can write 
8,(TIT4 (x)ed 7) = 8,(F (00, 2°)IT4 (2) F (2°, -00)) 
= (F(c0,2°) [14 (2), f a®zJa(o°, 2)94(2°, 2)] F(2®, —00) 


+O, IIe) 7%) 


poGse dy? oa ed 1) 


(8.67) 
where in the first step we have used equations (8.66) and in the second step 

we have used the commutation rules (8.64) and the equation of motion (8.62) 

interpreted as an operator Heisenberg equation of motion. We now write 


pe 
(TGapO"'yp Bef Jey — leant Fe—| Z(J| (8.68) 


where G4 = 6/6J,. In taking the factor 0“y? outside the vacuum expec- 
tation value and replacing y? by the functional derivative with respect to 
Jp, we may worry that there is an equal time commutator term due to the 
time-ordering. There is indeed such a term, but it involves the commutator 
of y with y and vanishes by (8.64). Equation (8.68) can be used to simplify 
the left-hand side of (8.67). We want to make a similar simplification of the 
right-hand side. We start with the expression 


Par ZEEE amp (a)e0 (2) ef ) = er Ob (2) o@(2) ef) 


(8.69) 
In simplifying this, we encounter commutators when we bring the z°-derivative 
inside the time-ordering symbol. By writing out the time-ordered product as 


T Of. 4(x)p° (z) ed 7 


= | F(0o, 2°)O4, , (2) F'2°, z°)y° (z)F(z°, —00)6(x° — z°) 


8.6 Nonlinear sigma models Aig 


+F (00, 2°)p© (z)F(z°, 2°)O# (x) F (2°, —00)0(z° — 2°) 


(8.70) 
and carrying out the differentiation using (8.66) we get 
6) OG Bc Op? Oy° 
jaa T OB alae (e) of 1) = cr Se oe aed) 
a (nee GBC 5M (q — ee ne 
(8.71) 
We have used the fact that 
OG 
Fae Oe? (2°, 2),0% (0°, 2)] = FACE”? Gray" (2°, 2), 6° (2°, 2) 
OG 
aa dy — = GPP TT, (2° fe), 0° (a ,2)| 
ie a Gas) (8.72) 


The idea now is to take z — z so that the first term on the right-hand side of 
(8.71) is what we want to simplify in (8.67). But this leads to the expression 
6(4)(0) which is the related to the volume of the momentum space. Thus we 
have to have a truncation of modes to do this entirely correctly. But we can 
proceed by writing this expression formally as follows. Notice that pointwise 


OGBc BC _ rs) 
DA oC = a logG (8.73) 


where the trace is over the indices B,C. The quantity log G may be regarded 
as an integral kernel with 


(z| log G|z) = 6 (24 — z) logG (8.74) 


so that by taking a functional trace as well as the trace over the indices B,C, 
we get 

Tr logG = 6“ (0) Tr logG (8.75) 
where Tr denotes the functional trace as well. Using equations (8.71) to (8.75), 
we can now write 


OGBc Op? dyC ed J) 


3 Op4 Oa" Ox, a a a(z)e°(2) ef”) 


Z27L ZL 


so mae 5 (Trlog @) ¢ yel 7) (8.76) 
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Taking the remaining terms out of the expectation value as functional deriva- 
tives with respect to J does not cause problems. The commutators encoun- 
tered along the way involve only y’s and vanish. In particular, we bring out 
the OF, ,(z)y° (z) in the first term on the right-hand side as functional differ- 
ential operators on Z[J] and then carry out the differentiation with respect 
to z* and then take the limit z — «. Using (8.68) and (8.76) in (8.67), we 
get 


3) Oy? 10Ggc 0p? O~° ii 


= 5 ae [J] 
(8.77) 


In terms of the action (8.61) we can express this equation as 
|- 6S os 6 
5pA(@) * 25—4(@) 


The integral of a total functional derivative will vanish with appropriate 
fall-off behavior for large y, so we can write 


—_— Trlog c| Z|J| = -iJa(x) Z[J] (8.78) 


pp 


6 
[leelgoa exp (iS + $TrlogG+ f Jy) =0 (8.79) 
Carrying out the differentiation leads to 


6S tet 6 is+ 4TrlogG Hf 
[ia is, Tele ore Torey E66 + Jala) oo a 
.80 


Bringing out the y’s as differentiation with respect to J’s in this equation, 
we get 


- 6S i 6 
dpA(z) 2 dpA(z) ver 


a / [dyjeS+ aT loeG+fJp gg) 


Faye tt OE ¢| i! ldgle'S+ aTrlogG + f Jy 


Comparison with (8.78) shows that we may solve for Z[.J] as a functional 
integral 


Z[J] =N fide exp (iS + $TrlogG + f Jy) 
= N / [dy|Vdet G exp (is e / Je) (8.82) 


As usual, this result is up to a normalization factor V. 
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On a Riemannian space with metric Gagdy4dy®, the volume element is 
given as Vdet G [] dy. This is indeed the volume that has emerged naturally 
from the above analysis for the functional measure for the sigma model. What 
needs to be done regarding the 6‘4)(0) that we encountered is also clear 
by now. We need to define a suitable regularization procedure to evaluate 
the functional determinant det G and then use it to evaluate the functional 
integral as well in a regulated way. As in the derivation for the y*-theory, the 
ze-term which has to be added to ensure the convergence of the functional 
integral at large values of y“ (and which leads to the choice of Feynman 
contour for propagators) will give the required fall-off behavior for the validity 
of the vanishing of the integral of the total derivative in (8.79). 


8.7 The connected Green’s functions 


In Chapter 5, we saw that the perturbative expansion of the Green’s functions 
and the S-matrix can lead to connected and disconnected Feynman diagrams. 
For example at the second order, the diagram in figure 8.1 is a disconnected 
diagram with two connected pieces while the diagram in figure 8.2 is an 
example of a connected diagram. At higher orders, obviously, we would get 
large numbers of disconnected diagrams. We now show that there is a general 
relationship between the generating functional for Green’s functions or the 
S-matrix functional and the generating functional for connected diagrams. 


Fig 8.1. An example of a disconnected diagram 


The generating functional Z[J] can be expanded as 
i 
Z| J] = Ne N! [tartan S(e)-Sen)Glar sea) (8.83) 
N 


where 


G(21,...2N) =N fidgle* ~(x1)...p(zn) 
= (~(21)...p(zw)) (8.84) 
Nis [ldelen* (8.85) 
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Fig 8.2. An example of a connected diagram 


(We shall use Euclidean Green’s functions for most of what follows.) The 
Green’s functions are averages over the fields with a probability distribu- 
tion given by Ne~S#. Disconnected Green’s functions arise when an avy- 
erage like (y(x1)...p(zw)) factorizes as, for example, into (y(21)...~(xx)) 
((p(r41)---y(an)). Let G° denote connected Green’s functions, viz., G{ is 
the fully connected Green’ function with N fields. A general Green’s function 
can be written as the sums of products of connected Green’s functions. Let 
G(x1,...cy) have n; factors of G§, ng factors of G§, ...,etc. The number of 
ways of factorizing G(x1,...cy) in this fashion is the same as the number 
of ways of partitioning N particles with n; boxes with one particle each, ne 
boxes with two particles each and so on. For convenience of the argument, 
define Q = f d*xJ(x)y(x). The expansion of Z[J] involves the averages of 
products of Q’s. Let Gy denote the average of N Q’s. In the partition of Gy, 
it is clear that the exchange of the nz boxes does not give a new partition; 
also exchange of the k particles in a box does not give a new partition. Thus 
the partition of Gy looks like 


=e sun SET am | (8.86) 


! ! 
nad n1: m2: 
subject to the condition }°,n,k = N. Thus we may write 


Gy = )> NI5(N — ¥> ngk) ars (Gs “ (8.87) 
{nz} 


We now use this factored form in (8.83); the summation over N in (8.83) then 
effectively removes the constraint imposed by 6(N — >> n,k) and we find 


2 (G3)™ (GS/21)"2 (GS/31)"s 
IES 2) i 3/3!)rs 


n3! 
= 55 6D” (5/2) agate 
ny! no! ng! , 


ees exp ($).. 
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= exp (W[J]) (8.88) 


where 


Wl] = Gi + 55 +... 
1 
= [tei@ere) + [ eed! znd 01) J(e2)G°(01, 22) Shick 
1 
= Df [ atr-atews(e1).J(on\G*(er, 0m) (8.89) 
ro 


WJ] is the generating functional for the connected Green’s functions. 

In the above result, we have used the normalized probability distribution 
Ne-=. Therefore there are no purely vacuum diagrams here; they are given 
by f[dple~S* and are removed by the normalization factor. A result similar 
to (8.89) holds for the vacuum diagrams as well. If calculations are done 
at a finite temperature and density, the average [ [dy]e—S# represents the 
statistical partition function. Its dependence on control parameters such as 
temperature and chemical potential are of interest. (The background heat 
bath now plays the role of the vacuum state.) In this case also, there is a 
version of the result (8.89). This is best seen by writing 


Nt = [faple-S* = (e-F) 
= > ayl(-Sine) Yo (8.90) 
N 


where the average is taken with the free action. Just as we considered the 
number of different ways of partitioning the product of Q’s to get (8.87), we 
can now consider the number of different ways of distributing the S;,;’s to 
write, in terms of connected functions, 


1 Sint)°)" ((Sine)°/2!)”? ((Sine)°/3!)”* 
wot S54 (W— Somat) Sal Pst si Ls 


Gay 12: ng: 
_ xr (Sint)°)™ (Sint) / 21) (Sine) /3)"* 
7 2d mt mat gh 
2 fe 
= 09a) (Et). 
= exp (W) (8.91) 


where the superscripts on the angular brackets again denote the connected 
vacuum diagrams with the indicated number of Sjn¢’s. W is the sum of all the 
connected vacuum diagrams. (Again, in the statistical context, by vacuum 
diagrams we mean processes involving scattering between particles in the 
heat bath or other similar thermal fluctuation effects, with no incoming or 
outgoing particles except those in the heat bath.) 
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8.8 The quantum effective action 


Closely related to the generating functional for the connected Green’s func- 
tions is the quantum effective action I'[]. It is the Legendre transform of 
W|J] defined by 


rs) = [ve — WI] eee (8.92) 
where the relation between ® and J is defined by the connecting relations 
or 
6D(x) 12 
éW 
—~=6 8.93 
se (8.93) 


These equations may be regarded as defining I"[®] if we are given W[.J], or 
as defining W|[J] if we are given I'[9]. If W[J] is given, we obtain J as a 
function of the free variable by the second equation in (8.93); substituting 
this into (8.92), we obtain I'[@]. Conversely, given I'[®], we can obtain @ as a 
function of a variable J using the first equation in (8.93), and then use (8.92) 
to define W|J]. One can expand I’[9| in powers of ® as 


Ne? - e 
I|9| = SS N! [andi piss d‘tn &(r1)P(2r2) Sc -P(zn) V (21,22, re) | 
N f 


(8.94) 
The coefficients V (21,22, .-)@n) are easily checked to be vertex functions, 
namely, Green’s functions with the external lines removed or amputated, as 
in (5.25). However, these are actually one-particle irreducible. In the graphical 
representation of vertices and Green’s functions, a diagram is said to be one- 
particle irreducible (1PI) if it does not become disconnected upon cutting 
any single one-particle propagator; diagrams which become disconnected are 
reducible. [| generates all the 1PI-vertices. 

From the connecting relations (8.93), we can write 


4, |_ a Oo 
[ee ana | G(z,y) =6%(@—y) (8.95) 
where 
: ew 
G(2,¥) = saa RIC uI eA (8.96) 


The result (8.95) follows since 6@(x)/dJ(y) and 6J(x)/5@(y) are inverses of 
each other. Notice that we have not set J = 0 in (8.95) or (8.96). By differen- 
tiating the relation (8.95) many times with respect to (x) and using the con- 
necting relations (8.93), one can check that the coefficients V (21, Dawn, ony) 
in the expansion of I’ are indeed the 1PI-vertex functions. 
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We can obtain an equation of motion for I directly from the equation of 
motion for Z[J]. Using the functional integral representation 


Z| =N fide] exp [seo +f J (8.97) 
we have the equation af motion 
EB _, Maa 9 a (8.98) 


In fact, it was from the Minkowski space version of this equation, viz., (5.9) or 
(8.30), that we obtained the functional representation for Z[.J]. In the above 
equation, the left-hand side involves various powers of derivatives with respect 
to J acting on Z[J]. This can be simplified as follows. Using W = log Z in 
(8.93) and differentiating, we get 


] O77 _ bP(21) 
Z65I(x1)6I(e2) 6 (x2) 
= G(r1, £2) + G(x) G(x2) 

= $(x1)$(x2) +1 (8.99) 


+ P(x, )B(x2) 


where 


(a1) = G(x) + (| d‘x2 G(21, 22) (8.100) 


ee. 
6D(x2) 
Differentiating once more, we find 

EE eee aries Cen eee coe 
Z 6J(x1)6J(x2)6J (x3) 


+ G(x2)G(x3, 21) + &(x3)G(x1, £2) 


5G (wa, #3) 
=i) G(1,24) ~ §0(a4) 


= $(21)$(r2) (a3) +1 (8.101) 


where we have also used (8.96). We see that differentiation with respect to J 
may be replaced by the action of ¢. We can therefore write (8.98) as 


oe " FeAl eae 


P er ~ en 
fe z Peon G(z,y) = 6 (a — y) (8.102) 


We have also repeated (8.95). It can be interpreted as defining G(a,y) in 
terms of I. With G(z,y) given by this, the first equation in (8.102) is a 
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nonlinear equation for I"[®]. Given a classical action Sg for the field, we can 
directly set up the equations (8.102). These equations can be considered as 
the fundamental equations for defining the quantum theory of the field. In 
this approach, W|[.J] is a derived quantity given by (8.92) as 


Wi] = [Je - Tia) os 


or 


The second equation is to be solved for J as a function of &. Equations (8.102), 
which may be taken as another definition of the quantum theory of the field, 
are a functional version of what are often referred to as the Schwinger-Dyson 
equations. They can be constructed in an analogous manner for any field 
theory. 

It is useful to write out the Schwinger-Dyson equations for the simple 
example of a y4-theory in some more detail. The Euclidean action is 


a [ae | 50? + sm + rol (8.104) 


For the first of equations (8.102) we find 


or 


BBay 7 KOC) + 4d@*(z) + 1208(2)G(@, 2) 


—4y G(z, 21)G(a, z2)G(a, z3)Va(z1, 22, 23) 


21522;23 


(8.105) 


where K = (—1]+ m?) and we have used the fact that we can write 


5G(x,y) _ " - er 
5(2) ~~ Jan an 0) SB PB m)EB A) OY) 
er G(a, z1)Va(z1, 22, z)G(za,y) 
- . er 
Va(z1, 22, z) — 5B(z1)5D(z2)dB(z) (8.106) 


which follows from the fact that G(x, y) is inverse to 6°T'/5G(y)5P(z), ac- 
cording to the second of equations (8.102). The vertex functions Vs depend 
on &; when @ is set to zero, they become the usual vertex functions of (8.94). 
Likewise, G(x, y) becomes the exact propagator when @ = 0. 

The second derivative of I is given by 


Vo(1,2) = K(1,2) + 12d je(1)? + G(1, 1)| 5(1, 2) 
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—12A8(1) ‘ G(1,3)G(1, 4)Va(3, 4, 2) 
3,4 
+12 [ G(1,3)Vo(3, 4, 2)G(4, 5)G(1, 6)G(1, 7)Va(5, 6, 7) 
Byaeanth 


—4.} 'G(1,3)G(1, 4)G(1, 5)Va(3, 4, 5, 2) (8.107) 
3,4,5 
We use the simplified notation G(1, 2) = G(a1, 22), etc. and K(1,2) = (-Ch+ 
m*)6(x1, £2). When @ is set to zero, the three-point vertices vanish since the 
theory has symmetry under 6 — —@. The second equation in (8.102) can 
then be written as 


il V(1,3)G(3, 2) = 6(1,2) 


V(1,2) = K(1,2) + ¥(1,2) - (8.108) 
Z(1,2) = 12AG(1, 1)4(1, 2) 


si [  G(1,3)G(1,4)G(1,5)V (3, 4,5, 2) 
3,4,5 


The equation for V(1,2) involves higher V’s, the four-point one in this case. 
One can derive equations for the higher vertices by further differentiations 
of equations (8.105) or (8.107). This will involve still higher vertices, leading 
to a whole infinite chain of equations. One needs to truncate them at some 
stage to do useful calculations. For example, for the four-point vertex we find 


V(1,2,3,4) = 4!A 6(1, 2)6(2, 3)6(2, 4) +--- (8.109) 


If we truncate this by keeping only the first term, the term which is explicitly 
shown in (8.109), we get 


Z(1, 2) ¥ 12AG (A, 1)6(1, 2) — 4! 4?G(1, 2)8 (8.110) 


This can be used in (8.108) to get a closed set of equations for the propagator 
G(1, 2). 

The Schwinger-Dyson equations are a set of equations for the exact prop- 
agator and exact vertices. One can generate the perturbation expansion from 
them by expanding around the free propagator. Let Go(zx, y) denote the free 
propagator; the subscript is to emphasize that this is for the free theory. We 
can then convert the equation for G into an integral equation as 


G(1,2) = Go(1,2) — [ Got, 3) 518, 4) (4,2) (8.111) 


This equation, together with the equation for '(1, 2), can be used to generate 
a series expansion for G. If we use the approximation (8.109) and compare 
the resulting series with the standard perturbative expansion for G, we can 
see that the equations (8.108) amount to resummation of an infinite set of 
Feynman diagrams. 
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8.9 The S-matrix in terms of I 


One can extend this point of view, namely, regarding the definition of I" 
directly in terms of the classical action as the way to quantize the theory, to 
the calculation of the scattering matrix as well. We have seen in (5.27) that 
the S-matrix functional may be written as 


Fly] = Z[i(a+m?)y] = ct ale (8.112) 


(There is also a trivial factor e~? JeCG+m*)e which we have not displayed 
since it is not important for what follows.) The S-matrix elements are ob- 
tained by replacing y by the free one-particle wave functions as in (5.24). The 
free one-particle wave functions obey the condition (0+ m)y = 0, or J = 0. 
(This is to be done after the required number of differentiations with respect 
to J or y.) Using the Minkowski space version of (8.92), we may write 


WiJ| = / d*z J® + iT [9 (8.113) 
When J is set; to zero, we have 
or : 
56 - 0 (8.114) 


and W becomes iI’[%] evaluated on solutions of the equation (8.114). In other 
words, from equations (8.112, 8.114), we can write 


F =S=exp (i r{@)) (8.115) 


Cia 
$570 


The S-matrix is given by the quantum effective action evaluated on solu- 
tions of the equation (8.114). This relation gives a nonperturbative definition 
of the S-matrix. The solutions to the equation S = 0 will be parametrized 
by some set of variables; this free data in the solutions are the quantities 
on which S depends. Perturbatively, the free data are the amplitudes ay, a; 
in the solution for y written as yp = 50, apux(x) + azuz(r). Here ay, af are 
viewed as c-number quantities. The amplitudes for specific processes are then 
obtained by differentiating S appropriately with respect to a,, aj and then 
setting them to zero. There are many applications of this definition of the 
S-matrix; for example, one can use it to derive recursion rules for scattering 
amplitudes whereby amplitudes with a certain number of external lines are 
generated recursively from amplitudes with lower numbers of external lines. 

Equation (8.114) can be considered as the effective quantum equation of 
motion, namely, as a c-number equation which nevertheless captures the full 
effect of the quantum dynamics. We see that I" is not only the generating 
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functional for 1PI diagrams, it defines the quantum theory of the field en- 
tirely. By solving the equations of motion (extremization condition for I’), we 
can define the S-matrix. One can also use it for analysis of nonperturbative 
aspects of the theory, for example, for analyzing nontrivial ground state prop- 
erties. In most of the situations we have considered so far, the ground state of 
the sytem was the ground state of the free-field theory. While this is adequate 
for perturbation theory, there are many situations where the ground state is 
modified by the interaction. We will need a nonperturbative analysis to see 
if a new ground state is dynamically chosen or preferable. The calculation 
of I’ (nonperturbatively) and its subsequent extremization can answer this 
question. 


8.10 The loop expansion 


The diagrammatic expansion of the Green’s functions or W[J] leads to a 
diagrammatic expansion of the vertex functions in I’. A systematic expansion 
procedure, which is useful both conceptually and for practical calculations, 
is given by expanding I in powers of h. 

The functional representation for Z[J] is given by 


Zl) =N / pre |-zSe() + / J¢ (8.116) 


This is the same as equation (8.97), but we have now explicitly indicated 
where ft appears. Since the propagator G is the inverse to K where 5 {eke 
is the free part of the action, we see that, with h included, G = AK~1 ~ h. 
From (8.116), we also see that the vertices must go like 1/h. For any term 
in I represented as a Feynman diagram with V; vertices of type 7, J internal 
lines or propagators and FE external lines, we have 


Sui -21=E (8.117) 


where v; is the valence of the vertex of type 7; for example, for an inter- 
action term Ay*, the valence is 4. The E-lines carry E external momenta. 
The J-momenta for the internal lines are constrained by the momentum con- 
servation 6-functions at each vertex. One 6-function simply expresses overall 
conservation of momentum. There are thus }°,; V;—1 6-functions constraining 
the internal momenta. The unconstrained internal momenta are the loop mo- 
menta which are to be integrated over. Thus, for the number of loop momenta 
or loops L, we have 


L=I-()/Vi-1) (8.118) 
i 


The number of powers of fi is given by I— 5°, V;, which is equal to L —1 from 
the above equation. A term with L loops in its diagrammatic representation 
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will go like h’~1. In this way, we see that the quantum effective action has 
an expansion of the form 


-r\o) = y Ae? piel (8.119) 


with the Z = 0 term corresponding to the classical theory; this has no loops. 
Diagrams with no loops are called tree diagrams. The h-expansion is a sys- 
tematic way to classify and analyze the quantum corrections. 

In the expansion (8.119), the term I’ [d] is in fact the classical action 
S(®). We can see this as follows. Using the definition of I in (8.116), with 
the factor of h inserted, we can write 


exo(-ZTIe) = [tae] exp (-75e(0) + f I0e- ®)) 
=N fide] ex (—F5e(0+2) + f Je) 


=N fide exp (-;Se(e +0) +3 / eR) 


(8.120) 


Using the expansion (8.119) and Taylor-expanding in powers of y, we find 


exp(— 3 ne Pig) = N exp(—7Se(8)) x 


L=0 
1 6r® 6Sr 
[tae ost foe 56 -) 
ak at a 
eo. bpiy ) » 


(8.121) 
The first two terms in the h-expansion give 
(8) = Sp() 
2 
r) [go] = = ; log det (sora) 8.122 
ae ig(a)i0W) ime) 


The determinant involved is a functional determinant. For example, for the 
dy4-theory we have 


1 
T= il | (09) + sme - 224 


h 
+ 5 logdet(—O+ m? + 12\$”) + O(h?) (8.123) 
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The identity Tr log A = log det A, which is certainly valid for any diagonaliz- 
able matrix A, and more generally for all matrices since diagonalizable ma- 
trices are dense, can be used to define and evaluate functional determinants. 
This gives 


f me 
5 log det(—O + m? + 12\6?) = ? Trlog(—O + m? + 12\6?) 


h 
7 f atclal log(—C1+ m? + 12\87)|y) 
ye 


(8.124) 


Expansion of this in powers of © will lead to a series of terms which correspond 
to one-loop diagrams with increasing numbers of external lines or &. A general 
formula for the result is difficult because depends on x and one can have 
many derivatives of appearing. If one is interested only in very slowly 
varying fields 4, one can evaluate the determinant explicitly, neglecting all 
derivatives of and treating it as a constant. This would also be the lowest- 
order term in an expansion of the determinant in powers of derivatives acting 
on ®. The term in I" corresponding to constant fields is called the effective 
potential; it was defined and evaluated to one-loop order by Coleman and 
Weinberg. For the one-loop correction to the effective potential, namely, the 
lowest-order term with no derivatives of &, we have 


; fae log(—O. + m? + iy] = 5 | ttt [= (an) z log(k? + s) 
= F(s) (8.125) 


where s = m? + 12A@7. 


os Lat fat of 
mote 


_h ‘ Da ar (8.126) 
ae) (27)? 2/k-k+s 


Integrating, we find 
Pa) teal oa |3 AVk-k+s— she k +m? (8.127) 


The expression for F, apart from the volume integration, is of the form of 
the sum over zero point energies with s in place of m?. We have chosen the 
constant of integration for the s-integration to be the zero-point energy for the 
free theory, so that F(s) is zero without the @*-interaction. The k-integration 
in (8.127) is divergent and the proper way to handle this is to evaluate it with 
a cutoff on the momentum and absorb potentially divergent terms into various 
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parameters of the theory. This is the procedure of renormalization discussed 
in the next chapter. For now, we evaluate it with a cutoff, or an upper limit 
for |k|, denoted by A and obtain 


Rigi 


2 2 
4 aya tee Bee 
307 ns f dtc pne (4 ag oa log(4A /m ) 
ih 
+(12\67)? € = log(4.A? /m?)) 


— = (m? + 126)? log ( | 


(8.128) 


For the sake of completeness, we will give here the renormalized form of the 
effective action, although details are discussed only later. To one-loop order, 
I is given by 


r= / oe” = 5m 4 204 
12,6? 


= | (m’ + 12\7)? log ( =| =) 2160%p'| 


647 
(8.129) 


As mentioned at the end of the last section, extremization of I’ can define 
the quantum theory. In (8.129), we have done a one-loop evaluation of I” 
for slowly varying fields. Notice that when we set the variation of I” to zero, 
the source J is zero and hence @ is identical to (0|¢|0); it is the vacuum (or 
ground state) expectation value. If the ground state is translationally invari- 
ant, then the expectation value is a constant. Thus, the effective potential 
approximation, where gradients of # are neglected, is adequate to analyze the 
ground state expectation value of ¢ in the full theory, if we assume that the 
vacuum is translationally invariant. Our evaluation of the effective potential 
to one-loop order can be used as a first approximation in this endeavor. 
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9 Renormalization 


9.1 The general procedure of renormalization 


We have already seen that the effective action I can be expanded in powers 
Chinese! = >* , h’ Pr) where I“) generates the one-particle irreducible 
(1PI) or proper vertices with L loops in the Feynman diagram. The loop 
integrations correspond to the fact that the interaction can induce virtual 
transitions to various intermediate states. Alternatively they may be thought 
of as the interactions of the incoming fields with the quantum fluctuations 
of the field in the vacuum (which must exist since the field and its conjugate 
momentum do not commute). If the loop-momenta are allowed to become 
arbitrarily large, which is to say that if transitions to virtual states of arbi- 
trarily large momenta can occur, some of the integrals can and do diverge. 
In effect, this means that the field theories we are discussing, with point-like 
interactions at short distances, are inadequate as descriptions of the physical 
world at very high momenta or at very short distances. We must consider 
these theories as valid only for momenta less than some very large value A. 
All loop-integrations are to be cut off in some fashion at this value A. The 
specific procedure for introducing a high momentum (or ultraviolet) cut-off 
in the theory is called a regulator or a regularization procedure. The resulting 
theory, which has an ultraviolet cut-off, and hence no divergent integrals, is 
called a regularized theory. The aim of quantum field theory is to provide a 
description of physical phenomena in terms of such regularized theories. 

In using quantum field theory for practical calculations, we must therefore 
take account of the following points. 


1. First of all, one needs a regulator which makes all loop integrals mathe- 
matically welldefined and finite. The calculated results with a regulator 
will depend on the cut-off A and hence on the specific regulator used. 
Unless we have a good reason to choose a specific regulator, this would 
lead to some ambiguity in the predictions of the theory even after a La- 
grangian is chosen. 

2. The calculated results such as the S-matrix will also depend on the pa- 
rameters such as the coupling constants (generically denoted Ao) and 
masses (denoted mo) which appear in the Lagrangian. (These parameters 
in the Lagrangian are often called the bare parameters.) The physically 
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measured couplings and masses are not the parameters in the Lagrangian, 
since there are, in general, corrections to them due to interactions. The 
measured couplings and masses are thus functions of the bare couplings 
and masses and the cut-off A. These are calculable functions once a regu- 
lator is chosen. In interpreting the calculated results such as the S-matrix 
or the effective action [ and in comparing them with experiments, we 
have to rewrite them in terms of the actual measured parameters \ and 
m. 


The two issues above are related. The idea of renormalization is that one 
can absorb all the A-dependence or regulator dependence of the calculated 
results into the relation between the measured parameters A, m and the bare 
parameters \9, Mo. In other words, the calculated results, when expressed in 
terms of the physically measured parameters, do not depend on the choice of 
the regulator. Thus after computation of the loop corrections and rewriting 
everything in terms of the measured parameters, we end up with the unam- 
biguous predictions of the theory. The measured parameters A, m are often 
called the renormalized parameters. 

The transformation of the parameters can be done at the level of the 
starting Lagrangian itself. For example, for the scalar field theory with quartic 
interaction, we can write 


if 1 

L= E (Ox)? + 30 | + Xo x4 (9.1) 
i 1 

= £3 5)? no 5 (rm = sm?) | +Z,2X yp (9.2) 


The transformation between the bare and renormalized quantities is explicitly 
given by 


Ae Za 
m2 =m? — dm? 
x=vV23 9 (9.3) 


Z,, Z3 and 6m? are functions of A and the renormalized parameters A, m. It 
will become clear that a transformation of the fields will also be necessary; 
this is the reason for the factor Z3. (This may be thought of as arising from 
corrections to the canonical structure of the theory.) The quantities Z;, Z3 
and dm? are called the renormalization constants. 

Since corrections arise due to loop diagrams (which carry powers of h), 
we can write 


co 
A=l+ Yona 
1 


lo. 9) 
23=1 + dong” 
1 
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dm? = _ h¥ (6m?) ) (9.4) 
1 


The renormalized parameters are, by definition, the measured values of 
the coupling constants and masses. Therefore, there are no further corrections 
to them. This means that the renormalization constants must be such that 
they cancel out any corrections which may arise from the loop calculations. 
The strategy for perturbative calculations is then the following. We start with 
the Lagrangian (9.2) and calculate I'[$] with some regulator. 6m?, Z; and 
Z3 are then chosen so that in I"[] the mass is m?, the $4-coupling is \ and 
the normalization of the kinetic energy term is 1. [6] should then have no 
terms which diverge as A becomes very large. The specific value of A is then 
immaterial except that it should be large enough so that terms of order 1/A 
can be ignored. From I'[@] one can obtain W[J], the generating functional 
for the connected Green’s functions by the Legendre transformation (8.103). 
The S-matrix can then be constructed from this. (What we have described 
is one ‘scheme’ of renormalization. There is some freedom of A-independent 
redefinitions in relating the renormalized parameters to experimental mea- 
surements, leading to other ‘schemes’. This is briefly discussed in the next 
section. Such schemes can be useful in some contexts, for example, when we 
have massless particles.) 

We shall now work through the implementation of these ideas to one- 
loop order in the scalar field theory with quartic interaction. There are still 
many more features of this renormalization procedure which require further 
discussion, but we shall do that at a later stage. 


9.2 One-loop renormalization for scalar field theory 
Since we are not interested in vacuum diagrams, at least not at this point, 


the simplest one-loop correction we can calculate corresponds to two external 
lines and is given by the following diagram. 


Fig 9.1. Scalar self-energy 
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The 1PI-diagrams and the corresponding vertex functions can be obtained 
from the general expression for the S-matrix functional given in (5.18). The 
Euclidean version of this is 


niet fobt) af-zale] os 


The mathematical expression corresponding to the Feynman diagram given 
above will have one power of the coupling constant and one Wick contraction. 
It is thus given by the term 


nan (2fobe) Laafe 
= —62,N f ate G(x, x)y? (2) (9.6) 


The propagator is given by 
1 d*k etk -(x—y) 


C= 7} Galt Fmt bn? On 


For this calculation, we will get one power of h from the propagator. Thus, 
to get the O(h)-term in the effective action, we only need Z;,Z3 and 6m? 


to O(h°) in the propagator and in (9.6); ie., we can take Z; ~ 1, Z3 ~ 
1, 6m? = 0 on the right hand side. The term corresponding to (9.6) in I'[$] 


is then As 
(dys Pp A 2 ( 
I; = ia f 2, (Qn)! p 5 reed foes xr ~P*( (9.8) 


If the integration over the loop momentum p in this expression is unrestricted 
in range, we see that this integral will be quadratically divergent. We must 
interpret the theory as having a cut-off for the momentum integration, p? < 
A?. We can evaluate the integral easily with this cut-off. Notice that the 
integral is spherically symmetric in four-dimensional p-space and so can be 
evaluated using spherical coordinates in p-space. We then find 


AZ 
ne 12 | 
e 1672 0 
3A A? 1 
i 42 [a — m? log (1 ae =) | pes 5” (2) (9.9) 


where we have used the fact that 


2 59 (2) 


/ d*p = 2n*p dp = ns ds (9.10) 
angles 


with s = p?. 
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We now have to include this term in I" and identify 5m”. This will be 
done after evaluating the correction to the four-point vertex also. 

The one-loop correction to the y*-interaction can be represented by the 
Feynman diagram shown in figure 9.2. The mathematical expression for the 
this diagram must have two powers of \ and two Wick contractions. The 
relevant term in F[y] is then 


ma ee eb slain 0 > Oi ae P 
sl dia Sor 5° amiss [xt m1 [eo few 


=Vor | P@oewFW) (9.11) 


Fig 9.2. One-loop correction to y*-interaction 


Once again, we will get one power of f from each of the propagators and so, 
as before, we can set Z} +1, Z3 +1, 6m? ~ 0 in evaluating this. 

Using (9.7) for the propagators with Z, ~ Z3 ~ 1, dm? = 0, and with a 
change of variables, we get 


1 = f dedty 8(@) Vie») #W) (9.12) 
where 
4, 
Vie) = ee V(k) 
= ee ee as 


(20)* (p? + m?)[(k — p)* + m?] 


The behaviour of the integrand at large p shows that this integral is logarith- 
mically divergent. We evaluate V(k) with a cut-off A as before. By using the 
Feynman integral representation 


1 ; 1 
ae ere Ce -_ 


we can combine the denominators of the two propagators to get 
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d4p 1 : 1 
vin = (00? | oe f, Tp emt 


d+p f? 1 
wi) an | “(pa ae STE coal) 


By a shift of variables p > p+ kv, we see that the integrad will be spherically 
symmetric in the four-dimensional p-space. The corrections due to the change 
of the limits of integration will be negligible if the momentum k is small 
compared to A. In this case we have 


(6X) 
~ 167? [a of” a [s+ ae —v)+m?]? 


--i heen) | Sle 


We have used (9.10) again. Since we have a parameter with the dimensions 
of mass in the theory, it is convenient to split the logarithm and write this as 


V(k) =V(0) + aan dv log +o”) 


V(0) = 5 ; [io og (#) -1| | (9.17) 


We now turn to the choice of the renormalization constants. The effec- 
tive action, with the one-loop corrections to the 6?-vertex and the }4-vertex 
included is 


V(k) = 


1 1 
Pe [2 | ;(00) + 5(m? — sma] + aa | o ify” aie? 4 
(9.18) 


The term which corresponds to the $?-vertex is given, to first order in h, 
by 


1 
Ip = / (00) + sme] +hZs” / 5009) - sme| 


+n{— — (5m?) + a [A? =m? log (+5)]} / ge 


(9.19) 


The requirement that the normalization of the kinetic energy term should be 
1 gives ZY = = 0. The mass is given by m? if we choose 


3A ne 
(5m?)Q) = = [a — m? log (1 ag =) (9.20) 


The &?-term in the effective action is then 
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if 
Ih= i. 3 [(O6)? + m?6?] (9.21) 
The term which corresponds to the *-vertex reads, to first order in h, 


y= » [ a. +h (rz{? bs v(0)) J 


ss f,ealf ther re 
V*(k) = EE i ae (1+ ean) (9.22) 


The basic strategy of renormalization is to choose Z0) such that the A- 
dependence of the effective action is canceled out. In the present case, we can 
choose AZ 3) +V(0) = 0. The *-term of the effective action is thus 
4 2 d*k ik-(x—y) * 2 
ry 


The A-dependence is eliminated by choosing the renormalization constant Z; 


to first order in fh as 
2 
Z,=1+ hoa a 7, [log (= ) - 1 (9.24) 


With this choice, the 64-term of the effective action can be written as 


ee " Iii ng uO ee en 
V(a1,22,03,04) = -| Maeve ethers (2/852) Ths) Valk) 


Valkg) = A + a dv log (1  Aecataaiaaet D2 pala — 2) 
(9.25) 


We kept factors of h up to this point to see how the renormalization constants 
cancel out the A-dependence in a systematic expansion. From now h will be 
set to 1 again. 

There are certain ambiguities in the way we have separated out the A- 
dependent part and the “finite” part (the part which is finite as A — oo). 
For example, we could split the logarithm in (9.17) using an arbitrary mass 
scale y to obtain 

2 pl 2 Pe ome 
9 es (S Be ie a *)) 
0 BL 


V(0) = = og (4) - i oes (9.26) 
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We also have a similar ambiguity in choosing Z1; for example, we could choose 
\Z +. V(0) = cd. The resulting V*(k) and Z; would be 


2 2 2 -_ 
V*(k) = e aan degiaies (3 an =) ) 
pe B 
2 

seater 2 fag(S) =i) (9.27) 
Notice that c can actually be absorbed into the definition of js, so that there 
is only a one-parameter ambiguity corresponding to y. The p-dependence 
of V*(k), after eliminating Z) and V(0), has to do with the meaning 
of the renormalized coupling constant. The coupling constant is a way of 
parametrizing the strength of the interaction and has to be determined via 
some scattering experiment. What is measured is the scattering amplitude 
at some momenta for the incoming particles. We can pick one value of these 
momenta and define the corresponding amplitude as the coupling constant. 
The scattering amplitudes at other momenta are then parametrized by this 
constant. Because of this y-dependence, strictly speaking, we must write A(2) 
for the renormalized coupling constant. In our first way of defining the renor- 
malization constant via equations (9.22, 9.23, 9.24), which corresponds to the 
special choice u = m, we see that V*(k) — 0 as k — 0. As a result, for the 
four-point vertex V4, we find 


Va(ki, ka, k3, k4) fe =x (9.28) 
We can identify \ as the value of the four-point vertex function (or the four- 
particle scattering amplitude, apart from trivial kinematical factors) when 
the momenta of the particles involved goes to zero. This is the \ we have 
used in the action; it can be further specified as \(m). In our second way of 
defining Z,, we find 


Va(ki) es = 2) + = 5 log (5) 
= Au) (9.29) 


We see that the second choice corresponds to a different definition of the 
coupling constant. Physical quantities will be independent of this ambiguity. 
This is easily seen in the present case by writing the various expressions in 
terms of the physical amplitudes at chosen momenta. First we write V*(k) 


as 
a 2 2 = 
V*(k) = “ff dv 1g (S 5 *)) 
pe y 


m? k?y(1 — v) 
=e) log (=) + aa dv log (1 + a) (9.30) 
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We can then write V4 as 


i 
V4 (ki) = V4 (0) oF — i. dv log (: == wu) qos (9.31) 
0 


where V4(0) denotes V4(k;) at zero momentum for the particles involved and 
k = ki + kg. We have eliminated X in favor of V4(0), to the order we have 
calculated. Equation (9.31) expresses the true prediction of the theory, giving 
V4(k;) in terms of its value at some fixed choice of momenta, k; = 0 in this 
case. In a massive theory, 4 = m is a natural and convenient choice and we 
shall use this for the y*-theory from now on. 

It is interesting to examine the low- and high-energy behavior of the 
effective action. By momentum conservation at the vertex, k = k, + ke, so 
that, for low-energy processes, we can consider the expansion of V*(k) in 
powers of k/m. We find 


ay 
822 m? 


V*(k) = (9.32) 
This shows that the first correction to the low-energy result, expressed in 
coordinate space, is a term of the form (0,,y")(0"¢y"). 

In the high energy limit we find 


: 9? k? 
V*(k) = ia log (S) (9.33) 


The corresponding scattering amplitude increases logarithmically with k?. 

The calculations given above illustrate the renormalization procedure. 
We see that, at least as far as the @°- and +-vertices are concerned, all 
A-dependence has completely disappeared and we have well-defined one-loop 
corrections in terms of the experimentally measured parameters. To complete 
the renormalization procedure to the one-loop order, we must consider higher 
vertices also, for example, the °-vertex given by the diagram 


a 
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Fig 9.3. One-loop correction to the 6-point vertex function 
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In this case we have three propagators and one loop momentum p, so that 
the integral at large p has the form 


1 
I~ iA d* of (9.34) 


There is no divergence at large A; the result is finite‘ plus terms which are 
of order as. Vertices with higher number of ®’s are also finite as A becomes 
arbitrarily large. These vertices can all be calculated without additional re- 
strictions or without introducing new renormalization constants. (Such ver- 
tices give further unambiguous and testable predictions about various types 
of processes.) We have thus carried out the renormalization of the theory to 
one-loop order. 

The renormalization constant Z3 is equal to 1 up to one-loop order. The 
correction to Z3 starts at the two-loop order and arises from the diagram 
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by \ 
See @-----@----- 
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Fig 9.4. Two-loop correction to two-point function 


This can also give two-loop mass corrections. There are also two-loop correc- 
tions to the *-vertex from diagrams such as those in figure 9.5. 
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Fig 9.5. Two-loop corrections to the 4-point vertex function 


We will not do a systematic calculation of the renormalization constants to 


two-loop order but will calculate ge? to illustrate how this can arise. The 
*-term corresponding to the two-loop diagram given is 


1?) = ‘ (2) V(,y) H(y) 


V(z,y) = —48\°G*(z, y) 


4 
= i are V(k) 


d*p dt* 1 
Va ~48)? [ - 4 2 2\(q2 2 2 2 
(27)* (2m)* (p? + m?)(q? + m?)[(p + g — k)? + m?] 
(9.35) 
The evaluation of the integral is rather involved, but the leading A-dependent 


term, which is the term relevant for 73, can be obtained without difficulty. 
The result is 


A2 


32 
Pr == | gz) F= log (=) 6 (2 —y) — F(z, | dG(y) 
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Ok gic y[3¥ (2 
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The terms in I which are of order fi? and involve two powers of © are 
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(9.37) 


Lid Thad, Tt ah) 
There are also two-loop corrections to the mass which we have not displayed 
above. From the coefficient of the term $(0®)*, we see that the choice of Z3 
which gives a A-independent I" is 


2 2 
Zs = 1- = hog (4) 5 (9.38) 


9.3 The renormalized effective potential 


In the last chapter, we calculated the one-loop correction to the effective 
potential. The effective action, with this term added, was 


r= I 23 [5097 + sm = a, + Z,\$* + F(S) 
F(®) = aa =a d‘x pane (+5 + a om Te toga? /m *)) 
+(12A¢7)? G s ccna 


+= = (m? + 12,0?)1og | 


(9.39) 


We have put in the Z-factors and 5m”. Renormalization can now be carried 
out by choosing these constants to cancel the potential divergences. When 
this is done, we get the Coleman-Weinberg potential as 


r= / aoe" + sme + 224 


12\¢? 


as Low +1267)? lo ( 


) — 12\m?6? — -216x6"| 
(9.40) 


where 


3A = 
6m? = er) 7(4+5 els 1 5 log(44?/m?) + 


Z,=1- a (1 - 5 log (4d? [m?)) 4... (9.41) 
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These renormalization constants differ from the set (9.20), (9.24) because 
of differences of regularization; notice, however, that the leading divergent 
terms have the same coefficients in (9.41), (9.20) and (9.24). 


9.4 Power-counting rules 


The renormalization procedure requires the introduction of the renormaliza- 
tion constants and then choosing them appropriately. Some of the questions 
which naturally arise at this stage are the following. Which terms or Feynman 
diagrams are potentially divergent and need regularization? How many renor- 
malization constants do we need? How do we extend the one-loop procedure 
systematically to higher loops? The answer to these questions will require a 
more systematic analysis of the possible divergences of Feynman diagrams. 
Divergences arise from loop integrations. Propagators, since they behave like 
1/k? for bosons at high momenta and 1/k for fermions, can improve the con- 
vergence of the integral. We define the superficial degree of divergence 6 of a 
1PI-Feynman diagram with loop integrations as the number of positive pow- 
ers of loop momenta minus the number of negative powers of loop momenta. 
In other words, each loop integration contributes +4 to 6; each propagator 
which carries a loop-momentum variable gives —2 if it is a bosonic propagator 
and —1 if it is a fermionic propagator. The superficial degree of divergence 
gives the highest powers of A which can arise from the evaluation of the inte- 
gral with a cut-off A. It tells us which diagrams are potentially divergent. For 
a general Feynman diagram, for the y*-interaction, let Eg be the number of 
(bosonic) external lines and Ig be the number of (bosonic) internal lines. If 
V denotes the number of vertices, since each vertex has valency 4, we have 
Eg = 4 V — 2Ig. With h for each propagator and 1/h for each vertex, the 
number of powers of h which must be equal to the number of loops L is given 
by L —1 = Ig —V. For a one-particle irreducible diagram, 6 is given by 
4. — 2Ip so that 


§ =A(Ip —-V +1) -2Ip =4—Ep (9.42) 


6 is determined by the number of external lines only and so there are only 
a finite number of diagrams at a given loop order which can be potentially 
divergent, namely, diagrams with Eg < 4. Corrections to the y?-term have 
§ = 2 since there are two external lines. The leading divergence is a quadratic 
divergence. There can be a subleading divergence which is logarithmic and 
which must carry two powers of external momenta for dimensional reasons. 
This would correspond to a contribution like (Oy)? appearing with a coeffi- 
cient which behaves like log A. We will therefore need a renormalization of 
the field as well as a mass renormalization. Corrections to y* have 56 = 0 
corresponding to a logarithmic divergence and will need an additional renor- 
malization constant corresponding to coupling constant renormalization. In 
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this theory we will not need any further renormalizations, since this exhausts 
all possible cases of 6 > 0. (Since the theory has symmetry under y — —9, 
terms odd in y do not appear in the perturbation theory.) 

The superficial degree of divergence 6 can be related to the dimension of 
the corresponding monomial of fields. We assign a dimension to the field y. 
The action has to be dimensionless because, among,other reasons, it is in 
the exponent of the functional integral. Since the kinetic term is f d*z(Oy)?, 
assigning a mass dimension of —1 to the coordinate z and requiring that the 
action be dimensionless, we find that the dimension of y must be 1. The di- 
mensions of the coupling constants are determined from this so that all terms 
in the action remain dimensionless. (Equivalently, the Lagrangian must have 
dimension equal to 4.) The effective action is also dimensionless and we can 
use this to determine the dimensions of the coefficients of various possible 
terms in I’. Thus, for example, the term [ d*x y? has dimension —2; the 
coefficient must have dimension 2. Since all other dimensionful parameters 
can be ignored compared to the cut-off A, we can conclude that the coef- 
ficient of this term must go like A?. In other words, we expect a quadratic 
divergence for this term. Similarly, for the term {(Oy)? the coefficient must 
be dimensionless; it has to be a logarithmic divergence. The coefficient of the 
y*-term can have logarithmic divergences by the same reasoning. Notice that 
in all these cases, 6 is essentially 4 minus the dimension of the number of y’s 
involved. 

While it can help to identify the potential divergences and the kind of 
renormalizations needed, the superficial degree of divergence is not the whole 
story. The actual degree of divergence may be different. A superficially con- 
vergent diagram may have subdivergences as in the following case, fig 9.6. 


Fig 9.6. A two-loop correction to 6-point vertex function 


One may also get entanglement or overlap of loop-integrations. It may happen 
that for one of the loop-integrations, the other loop momenta are to be treated 
as external momenta. Then, depending on how the loop-integration is done, 
the degree of divergence for the next set of loop-integrations may be different. 
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The way to deal with this in perturbation theory is to develop a systematic 
recursive procedure, based on the power-counting rules, with a subtraction 
procedure for separating out and eliminating the divergences. We shall take 
up this question later, after we consider the one-loop renormalization of QED. 


9.5 One-loop renormalization of QED 


The Euclidean action for QED was given in (8.58) as 


Se(Adv) = f ats E (OA 0, AL) OG, (CemieA) ana) ¥| 


= [ate [FFP + 5(0-A)? +0 (1: (O-ied) +m) 


(9.43) 


The free part of the action shows that 7 and w are fields of dimension 3 while 
the photon field has dimension 1. The interaction Lagrangian is of dimension 
4; e has no dimension. £;,,; shows that there are two fermions and one photon 
line at each vertex. For a Feynman diagram with Er external fermion lines, 
Fg external photon lines, J- fermion internal lines or propagators, Ip photon 
propagators and V vertices, we have 


Ep =2(V ~ Ip) 
Ep =V —2Iz 
L-l=Ir+Iip-V (9.44) 


The superficial degree of divergence is given by 


6=4L—Ip —2Ip 
=4 — = Ep — Ep (9.45) 
Notice that 6 is again 4 minus the dimension of the fields involved. 

The potentially divergent diagrams, which correspond to 6 > 0, are thus 
the following. 

1 d=4, bp — Ep = 0. 

These purely vacuum diagrams will be canceled by the normalization of 
the functional integral or equivalently by the requirement that (0|0) = 1. 
(If we have a nontrivial background such as a gravitational field, diagrams 
with no external lines can be important. The relative vacuum contribution of 
different backgrounds is what is relevant and this is obtained by comparison 
of the vacuum diagrams with the backgrounds involved.) 


148 9 Renormalization 


2)6=3,Er=0, Eg =1 

Diagrams with an odd number of external photon lines will vanish due 
to the charge conjugation invariance of QED; we shall discuss this symmetry 
later. 


3) 0 = 2380. eee sae 

These correspond to corrections to the photon propagator and are called 
vacuum polarization diagrams (or photon self-energy diagrams). Naively, the 
degree of divergence is 2, suggesting a quadratic divergence. A local pho- 
ton mass term of the form A,{z)A“(zx) is not allowed by a combination of 
Lorentz invariance and gauge invariance. As a result, if we use a regulator 
which is gauge- and Lorentz-invariant, there will be no quadratic divergence. 
The lowest-dimension term which is gauge- and Lorentz-invariant is FF”, 
which is of dimension 4. The coefficient being dimensionless, we expect only 
a logarithmic divergence for the vacuum polarization. 

Although the (0 - A)?-term breaks gauge invariance, these diagrams have 
fermion loops and, since the fermion part of the action has gauge invariance, 
a gauge-invariant regulator can be used. 


4) 6=1,Ep =0, Ep =3 


These diagrams vanish by charge conjugation invariance. 


5)6=1,Er=2, Ep =0 


These correspond to corrections to the fermion propagator and are called 
fermion self-energy diagrams. Naively, they are linearly divergent. But ac- 
tually, because of the properties of Dirac y-matrices, the divergence is only 
logarithmic. The theory with zero mass has a symmetry, the so-called chiral 
symmetry. The linear divergence, which would correspond to a mass term 
w(x)%(x), has to be zero to respect this symmetry. As a result, even in the 
massive theory, this term has to have a factor of m in its coefficient. The 
remainder is of zero dimension or at most a logarithmic divergence. 


Hor — 0; bes 

This describes photon-photon scattering; the corresponding lowest-dimension 
monomial in the effective action is A,,A“A,A”. Such a term is disallowed by 
gauge invariance, the lowest allowed term has four factors of F,,,, giving a di- 
mension equal to 8. The coefficient has dimension —4, and hence this diagram 
is convergent if evaluated in a gauge-invariant manner. 


7) 6=0, Ep = 2, i al 


These correspond to corrections to the basic vertex ewy*pAy and are 
called vertex corrections and ultimately are part of the charge renormaliza- 
tion. The divergence is logarithmic. 

The discussion given above shows that there are three types of diagrams, 
namely, the photon and fermion self-energy diagrams and the vertex correc- 
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tion diagrams, which we must consider to work out the renormalization of 
QED. The photon self-energy correction can lead to a term like Fy?" in 
the effective action with a logarithmically divergent coefficient. The fermion 
self-energy can give terms like my7 and wy: Ov, both with logarithmically 
divergent coefficients. The vertex correction is again logarithmically divergent 
and gives a term of the form ~y4wWA,. We will need to introduce renormal- 
ization constants corresponding to these monomials. Since they are of the 
form of the terms in the action, this amounts to modifying the coefficients of 
the terms in the Lagrangian. 

Interpreting the parameters in (9.43) as the bare parameters, we can 
rewrite it, in terms of the renormalized parameters, as 


ih, A 
Ne aa aF 3 


A (0- A)? + Zo (y-O+m— dm) 


SHON Dp " de 


—ieZ wy wAp (9.46) 
The renormalization constants Z;, Z2, Z3, 4,dm have the h-expansion 
Aa=t+ Sn 
1 
[e.@) 
A=1+ ) AN) 
1 
oo 
dm = hb’ (5m) (9.47) 
1 


fone 1, 253: 

We now turn to the calculation of these constants to the first order in 
h. This requires the evaluation of the one-loop contributions to the elec- 
tron (fermion) self-energy, the vacuum polarization and the vertex correction. 
(Once again, hf will be set to 1 in what follows, the terms of the same order 
in fi will be collected together in the effective action.) 


Electron self — energy 


The electron self-energy diagram is given by 
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Fig 9.7. Electron self-energy 


The corresponding contribution to I’ is given by 


Gee. 4,94, ak cis eilk+p)-(x— v5 =? : é 


= P(x) E(a, y)b(y) (9.48) 
where 


d'p_ ip (e-v) 
ey el a 7) 
Elev) = | Gave () 
os ie) ta 1 

(om\4 £2 |) ee 
(27)4 k m+iy-(p—k) 
We have set Z; +1, 11, 5m = 0 in the propagators and vertices in (9.49) 
since this is adequate to the order we are calculating. Combining denomina- 


tors using the formula (9.14) and making a shift of the variable of integration, 
we find 


= 3 i 2m(1+v) 
3i(p) = ae d [Ss (2n)4 ae 24 p2v(1 — v) + mv)? 


+(m+iy- 


(ny) = E> (9.49) 


2(1—v) | 


”) P+ Poll — 0) + me 
(9.50) 


(As before, the change of the limits of integration gives a negligible correction 
if the cut-off A is very large compared to the momenta p.) The integral can 
be explicitly evaluated as 


ic = 1421 
On? [2S ped—o)taeoe Tie | eee ee 


m2y2 
ae ee + K vil - 5)| 
(9.51) 
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where K = p? + m?. The self-energy contribution is now 


Di) | = (m+ iy: p)X2 + 3”* 


4r m? 2 
a (N\ 
0 = 
E log (=a) + OR 
1 myo 
‘ am m*v* + K v(1—v) 
ge ee dv(1+v)1 a 
2x Jo piety) of ( m?y? ) 


—(m+iy-p) fae (1 ~v) log (AEC) 


my? 
(9.52) 
where a = e?/4r. (It is the fine-structure constant.) Notice that K, continued 


to Minkowski space, will vanish for free electrons of mass m; the last term 


+/* will thus vanish for free electrons. The contribution to I’ can be written 
as 


ny =f im by +52 d(y-O+m)y] +f dor, y)bty) (9.53) 


Vacuum polarization 


The vacuum polarization or photon self-energy is given by the diagram 


Fig 9.8. Vacuum polarization or photon self-energy 


The corresponding mathematical expression is 


r= 3 f At(@\tulewA’w) 
atk 
Gaye 
d*p Tr[y(m—iy: (p+ k)) v(m — #7 -p)] 
Twlt)=-€ | Ge Geena E RPM OM 


Typ a eM IT Tk) 
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The main difficulty in evaluating this expression is the question of the cut- 
off. The leading, A-dependent term in [4,4 behaves like A?A“(z)A,(z), a 
gauge-noninvariant mass like term for the photon. The reason for this is that 
a cut-off for the range of momentum integration is not gauge-invariant. One 
possibility is to introduce a gauge-invariant regulator which is a little more 
sophisticated than just a cut-off for the range of momeptym. Dimensional reg- 
ularization is one such regulator where the theory is first defined in spacetime 
dimension 4 — ¢, « < 1, by analytic continuation. This isolates the potential 
divergences as poles in e. The limit « — 0 may be taken after the poles have 
been canceled by choice of the renormalization constants. This technique pre- 
serves gauge-invariance at all stages. Another, more simple-minded approach 
is to introduce one more renormalization constant, adding a term 56? A? to 
the starting Lagrangian. A cut-off procedure can then be used, keeping the 
A? A2-term generated by the vacuum polarization. At the end, we choose dp? 
so as to have gauge-invariance for J’. In effect, this amounts to dropping the 
gauge-noninvariant part of IJ,,, in (9.54). We shall follow this approach here. 
(Since fermion action is gauge-invariant, it is completely consistent to require 
gauge invariance in the evaluation of this diagram.) 

Gauge-invariance requires invariance under A,, — A, + 0,6, where @ is 
an arbitrary function of the spacetime variables. If I7,,, obeys k#II,,(k) = 0 
in other words, if it is transverse to k¥, this symmetry holds for the vacuum 
polarization contribution. We shall therefore separate out the transverse part 
of IT,,, and set the nontransverse part to zero and then use a cut-off to eval- 
uate the integral. Taking the trace of the y-matrices in (9.54) and combining 
the denominators, we can write 


+ 497) buy + v(1 — v)(2k,k, — k70;0) 
Bat) = 46 | ao | To 


(9.55) 


where q = p+kv and we have done a shift of the variable of integration. Terms 
odd in q have been dropped since they give zero upon angular integration. 
We have also used the fact that the integral of g,g, is proportional to 6 
iO. 


pV) 


d*q 6 d*q 
F(¢2 ee 2) 2 
| ope FO) aa =e ft PP (9.56) 
for a function of g*. The transverse part of [7 pv iS given by 
k“kP IT, 
Tp (k) = Hay — by" 
1—v) 
tute [oto f q cagee Nee 
= (FO ) (Qn)! [q? + m2 + k2v(1 — v)|? 


(9.57) 


The evaluation of the integral is now straightforward and gives 
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IIT, (k) =(k®S yy — kukv) é (10g (4) : 1) 
ia 7 fay v(1 — v) log co) 


(9.58) 
Using this expression, we find the contribution to I as 
1 1 Ae 
r=} fr [e (I) 
ek - * 
+i f Bute f gyro re] Pw 
(9.59) 


where 


Pie(k) = = ie dv v(1—v) log (1 He we (9.60) 


The vertex correction 


The vertex correction is described by the diagram shown in figure 9.9. 
The contribution to [ may be written as 


rh, = ie | HoT. 2. WAM) 
d*p dép! 
(On) (any 
— dk alm —iy-(p—k)ylm— ty: — ky 
Fale) =—e" J Compe REl(p RP ml = BE 

(9.61) 


T(z, z,y) = el Be!’ Ye-iP-v)-# L,,(p, P’) 


Fig 9.9. One-loop vertex correction 
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We will evaluate only the leading A-dependent term in (9.61). For this, since 
the high k-values are the important regime, the external momenta p, p’ can 
be taken to be small compared to k. We then find 


dh YaVahaicak 
we? ee 9.62 
Tees = fee k2(k2 + m2)? ( ) 


The algebra of the Dirac y-matrices gives yay - kypy > ky*~ = 27k? —4kykyy”. 
Further, since the integral involves only k?, we can use (9.56). Equation (9.62) 
for I, now becomes 

d*k 1 A 
(27)4 (k? + m2)? 


a A 
= Ve re log Fz ee 2 


a 2 
Iu log (<3) fore (9.63) 


In =p e7 


The A-dependent part of the ~)Aw-vertex in I’ is thus 


a A2 
ry, y = Wie i, ay» Ab E log (=3)| (9.64) 


We are now in a position to collect the results together. The effective 
action, to first order in h, can be written as 


if 1 = es 
P= [FFP +5(0- AP + 8(y-0+m)y ~ ied eA, 


1 a A? 1 
1 ees 7 alg. ay 
[tprer zs + aE (ios ( i) 1)| +. 5(9 A) 


a Ae = 
oe [zs Ae log (=) a5 = w(y-O+m)y 


+ A —independent terms + --- (9.65) 


The A-dependent terms are absorbed into the definition of the physical pa- 
rameters if we choose the renormalization constants as 


a ike 
— = mye 
: foe (=) + 


a we a 
Ae ee pa) ee Bae 
ae An log (=) a 
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Ve a (9.66) 


where the ellipsis stand for terms of order h?. The resulting I", to first order 
in h, is given by 


il = = 
he i (Grere 4- 5(0 -A)? + W(y-O+m)y - ied, 
+ dows? f Rumer ~ie f brpAty + (9.67) 


where &* is given in (9.52), J7* is given in (9.60), and I’ denotes the fi- 
nite, A-independent term of I, of (9.61). We have not explicitly calculated 
I here, but it does appear in J’. There are also all the one-particle irre- 
ducible vertices with more external particles which we have not calculated. 
They are convergent as A — oo and do not affect the determination of the 
renormalization constants. 

The finite corrections 1’* and JI* vanish when the fields obey the free 
field equations of motion with the correct masses; i.e., when iy-p+m = 0 
for the electron and k? = 0 for the photon. These corrections do not affect 
the propagation of the free particles, in the perturbative expansion we are 
using. For a bound electron, the correction need not vanish and indeed gives 
a measurable effect, the Lamb shift, for example. 

Notice also that, to the order that we have calculated, 


T= 22 (9.68) 


We have only calculated the dominant A-dependent part for the vertex cor- 
rection, so the comparison can only be made with the A-dependent part of Z2. 
The relation (9.68) is a consequence of gauge invariance and is an example of 
a Ward-Takahashi (WT) identity. It gives gauge invariance for gauge trans- 
formations involving the physical (renormalized) charge e. Equation (9.68) 
and other related identities will be derived later. 


More on vacuum polarization 


We shall now consider the simplification of the finite correction (9.60) to 
photon propagation, the vacuum polarization effect. First consider the case 
of very large values of k?, i.e., k? >> m?. In this case, we can approximate 
II* as P 

II*(k) % —< log (=) (9.69) 
30 m? 
By solving the equations of motion for I’ with an external current J,, we see 
that the interaction is given by 
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nN 
Wald] = 5 f Jule) Dl) J) 
ee 3 a oe 
= -=— a rd 
Dre Kk [ on log (=)| (9.70) 


We see that the interaction is effectively stronger at high values of k?, or at 
short distances. Vacuum polarization is thus a screening effect, the interaction 
at large separations is weaker. We also see that the true expansion parameter 
for QED is a log(k?/m7) and not just a. The formula (9.70) actually shows a 
pole for D(k) at k? = m? exp(37/a). This is known as a Landau pole. Clearly, 
perturbation theory will cease to be valid before one gets to these energies, 
so one cannot conclude anything definite about the pole from the calculation 
we have done. The pole is at best indicative of the fact that pertubative 
QED must be considered as an effective theory valid only for energy scales 
far below this value. 

The result (9.70) can also be expressed as aD(k) = Qe ¢(k)/k? by defining 
an effective k-dependent charge 


a 


Qe ¢(k) => 1 — & log (k2/m?) (9.71) 


Consider now the low energy limit of vacuum polarization. This would 
be important for atomic systems, where the typical value of k is in the 
electron-volt range which is much smaller than the mass of the electron 
(m.  0.51MeV). Expanding the logarithm in JJ*, we find 


SS 1 . a , 
~ Pies pee (9.72) 


The interaction between two charges is modified by the second term. For 
example, the electrostatic interaction between an electron and an atomic 
nucleus of charge Ze is now given by 


D(k) 


V(r) =—Ze? |= + 2 


r 157m? a a) 


The correction term to the standard Coulomb term is known as the Uehling 
potential. It gives a shift to the atomic energy levels; it contributes -27M Hz 
to the Lamb shift in Hydrogen and is thus a measurable effect. 


Lamb shift 


The 2S; — 2P1 splitting of the energy levels of Hydrogen-like atoms is 


2 2 
known as the Lamb shift. For the case of Hydrogen, two recent experimental 
values, due to Lundeen and Pipkin (LP) and Andrews and Newton (AN), are 


E(251) — E(2P1) = 1057.893+0.020MHz (LP) 
P41 
= 1057.862+0.020MHz (AN) (9.74) 
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Theoretically, there are many contributions to this effect. The major contri- 
bution is from electron self-energy. While the finite correction * vanishes 
for a free particle, X*(bound), calculated with bound-state propagators is 
not zero, and leads to shifts in the energy levels of a bound system. Calcu- 
lating this self-energy correction to order a? and including other effects due 
to the anomalous magnetic moment of the electron, the finite size of the nu- 
cleus, nuclear recoil effects and the Uehling correction discussed above, the 
theoretical prediction becomes 


E(2S1) — E(2P1) = 1057.864 + 0.014MHz (9.75) 
2 2 


This excellent agreement between theory and experiment may be taken as a 
confirmation of the self-energy and vacuum polarization effects. 


Anomalous magnetic moment of the electron 


The finite part of the vertex correction, which we have not calculated, pre- 
dicts an anomalous magnetic moment for the electron. This can be obtained 
by isolating the term proportional to [y,,,7,|(p—p’)” in (9.61) and evaluating 
it in the limit of low external momentum. The result can be expressed as a 
term w[Y,.,]v FH” in the effective action. This is like a magnetic moment 
interaction and one can express this as a shift of the gyromagnetic ratio of 
the electron, g — 2(1 + a/27). In other words, the theory predicts that the 
gyromagnetic ratio of the electron should be different from the value 2 which 
is given by the one-particle Dirac equation; the electron has an anomalous 
magnetic moment. Including higher-order (upto O(a?)) contributions, the 
result is 


g-—2 


= (1 159 652 140(5.3)(4.1)(27.1)) x 107!” (theory) 
= (1 159 652 188.4(4.3) + 200) x 10°” (experiment) 
(9.76) 


The numbers in brackets indicate the uncertainties, due to various sources, in 
the last decimal places.) This is one of the most accurately tested predictions 
of quantum electrodynamics. 


9.6 Renormalization to higher orders 


The one-loop calculations we have done so far show that if we start with the 
action of (9.46), namely, 


CCE |e 23 pF + 2(9. A)? + Zatb (y--+m— 5m) yp 


—ieZ;py"bA,| (9.77) 
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and if we choose the renormalization constants as given in (9.66), then the 
effective action I’ is independent of the cut-off A when A becomes very large 
compared to the momenta involved. This is the renormalization procedure to 
one-loop order. We shall now discuss how this procedure can be extended to 
higher orders in h. 

It is convenient to rewrite the above action in the form 


Su(A,B, b) = SalA,¥, 0) + Sc(A,¥,¥) (9.78) 


where 


SulA,¥, miele 


Fw FH += 5 (3: Ay? +0(y-04+m)—iedy*pA ! 


A-1 
PY + (8: Ay 


SoA, def ate 


+(Zo — 1b (y- 8 +m) p — Zedmynp 


—ie(Z, — 1)py"PAy (9.79) 


Sc is the sum of the so-called counterterms. The counterterms have coeffi- 
cients which depend on A and are canceled out in the effective action by the 
loop corrections. 

As we have seen, for the calculation of the corrections at the L-th loop 
order, we need the renormalization constants (or counterterms) only to one 
order less in h, namely, to order h’-1. The calculated loop corrections at 
the L-th order can then be used to fix the renormalization constants to the 
-th order, and these in turn can be used for the next-order calculations. In 
other words, one can recursively carry through the renormalization procedure. 
The integrands are to be expanded in powers of the external momenta or 
combinations of these such as the mass-shell quantities like p? + m? and 
iy-p+m. The first few terms, up to and including the power corresponding to 
6, the superficial degree of divergence, can have A-dependence. The remainder 
will be finite as A becomes very large. 

There is another, related way of viewing this procedure. We can use just 
the action S,; of (9.79) with no counterterms, and then, instead of potential 
divergences being canceled by counterterms, we can equivalently give a pro- 
cedure to define a new integrand, the so-called renormalized integrand, which 
can be used for the loop calculations. The renormalized integrand is defined 
so that there will be no further divergences when the loop integrals are done. 
In other words, given the integrand Zg for a 1PI Feynman diagram G, which 
is constructed from S,., we replace Tg by a renormalized integrand Rg and 
then do the loop integrals. We then have to give a method of writing down 
Rg given Tg. This will involve subtracting certain terms from Z¢; in com- 
parison with the counterterm approach, the terms which are subtracted may 
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be viewed as the conribution of the counterterms. This approach to renor- 
malization is usually called the BPHZ method, after Bogoliubov, Parasiuk, 
Hepp and Zimmerman. 

The Feynman diagram G may have subdiagrams which are divergent, 
even if G is superficially convergent. We have seen an example of this at the 
end of section 9.4 for the scalar field theory. (In the counterterm approach, 
some of the subdivergences are canceled by the lower-order counterterms.) 
Consider a particular term in I with a specific monomial of fields. Let Re 
be the corresponding integrand at a given loop order obtained by including 
the subtractions due to all lower-order subdivergences. Since subdivergences 
are subtracted out, the only possibility of divergence in using Rg is from the 
whole diagram, namely, only if 6g > 0. Therefore, if dg < 0, we define the 
renormalized intergrand Rg as 


= Rg, 6g <0 (9.80) 


and if dg > 0, we make a Taylor expansion in powers of the external momenta 
(or mass-shell quantities) and define 


Re =Re -— TeRce = (1 — Te) Re, dg >0 (9.81) 


Tc is the Taylor expansion of Rg in powers of external momenta (or mass- 
shell quantities) up to and including the order dg. (In comparison with 
the counterterm approach, T¢Re gives the renormalization constants or the 
counterterms corresponding to the order of the calculation.) 

We must now give the construction of Rg from Ig. Suppose Zg has a 
divergent subdiagram +y. We can then write Zg = Ig). Z,. The contribution 
of the counterterms is —T,R,, for 7, so for G we get Zg/(—T,R,). Suppose 
71, Yq are disjoint subdiagrams with renormalizations. (The diagrams are 
disjoint if they have no common line or vertex.) In this case, we can isolate 
the potential divergences as 


Losin} (-Fn Ry,)(—T, Ry.) (9.82) 


If the subdiagrams are not disjoint, we cannot write the counterterm 
contribution as above. For example, for the diagram in figure 9.10 we will 
have Z,-counterterm contributions or lower-order subtractions of the type 
shown in figure 9.11. The X’s in the diagrams in figure 9.11 denote the one- 
loop vertex counterterms. The mathematical expression corresponding to this 
is of the form T,, Ry, + Ty,Ry,, where 71, 72 are as shown in the diagram 
9.12 below. This subtraction is clearly not of the form (9.82). 
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Fig 9.10. A two-loop torrection to vacuum polarization 


wd pa wnt San 


Fig 9.11. Z; counterterm contributions in calculating vacuum polarization 
to O(h”) 


We can continue the series (9.82) to all disjoint subdiagrams. Once all lower- 
order subdivergences are subtracted out in this way, we get Rg. Thus 


Re = Tet+ S> Testnirn,.¥ [][(-Tx Rx) (9.83) 
Y1sY2s-> t 


This is Bogoliubov’s recursion formula. The sum is over all disjoint families 
of subdiagrams, namely, those with 7; 17; = 0. From (9.83) and (9.81), we 
get Re. 


Fig 9.12. Subdiagrams for two-loop vacuum polarization 
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As an example of how this formula may be applied, consider the two-loop 
vertex correction in figure 9.13. 


Fig 9.13. A contribution to two-loop vertex correction 


The disjoint families are y ( and the null diagram 0 and the full diagram G). 
The recursion formula gives 


Rg =Ig+ Le /4(—TyRy) =e +Ie@;-(-T,L,) 
=(1-—T,)I¢ 
Rg = (1-Ta)(1—Ty)Te (9.84) 


We have used the fact that R, = TZ, since, at one-loop, there are no diver- 
gences from the previous order. 

As another example, consider again the two-loop correction to vacuum 
polarization. In this case the disjoint families are y, and 2 as shown, in 
addition to the null and full diagrams. We then have 


Re =Ig+ Le /yy Gr, tee) a Ze /-1a (-T,, Ry,) 
ma (1 an Ty, = a) Ie (9.85) 


where, once again, R,, =Z,, and R,, = Z,,. The renormalized integrand is 
thus 

Rg = (1-Te) (1- Ty, — Ty) Te (9.86) 
Notice that Rg # (1 — Tg)(1 — T,,)(1 — T,.)Zc. This is because of the 
overlapping nature of the two loop integrations. 

In these formulae, we have a recursion rule; we must first determine R,, 
and then use (9.83) to obtain Rg. A solution to (9.83) and (9.81) directly 
in terms of Zg has been given by Zimmerman. This is the forest formula. A 
forest U is defined as a family of subdiagrams with the following properties. 


1. +; € U are proper superficially divergent diagrams. 
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2.7, 72€U— nN C 72 Or yo CN Or 71142 = D; ie., they are either 
nonoverlapping or one is contained in the other. 
3. U may be empty. 


The solution to (9.81) and (9.83) is then given by 
Re= >) [[t=)ic Ga (9.87) 


U yeu 


This is Zimmerman’s forest formula. 
Consider the application of this formula to the two-loop vertex diagram. 
The forests are 0, {y}, {G}, {y,G}. From the forest formula 


Re = Tg —Tyie —Tete + 1, tete 
= (1—Tg¢)(1-—T,)Ie¢ : (9.88) 


which is in agreement with (9.84). Similarly, for the two-loop vacum polariza- 
tion diagram, the forests are 0, {G}, {yi}, {v2}, {71,G}, {v2,G}. The forest 
formula gives 


Re = (1-T,, -Ty, —Te + T,, Te + Ty,Te) Te 
= (1-Te)1—T), — T,) Le (9.89) 


This agrees with (9.86) as well. 
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We now turn to the renormalizability of QED. A field theory is said to be 
renormalizable if the renormalization procedure can be carried and a finite 
effective action I constructed such that it involves only a finite number of 
undetermined parameters (masses and coupling constants). These parameters 
are to be determined from experiments. If I involves an infinite number of 
undetermined parameters, we say the theory is nonrenormalizable. QED is a 
renormalizable theory. 

For the purpose of doing loop calculations, the action for QED was written 
in the form (9.78), viz., 


Sz(A, v, v) = Sai(A,¥, 4) + Sc(A,¥, ¥) (9.90) 


where 


Sa(A, vp | d*z 7 Fw FH += (0: A)? +0(y:0+m) vo —iedy"bA, 


ane 


So(A,b,WRf ate |= 


SMa = Ae Aye 
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+(Z2 — 1b (y- 8 +m) — Zodmipy 


~ie(Zi — 1)b-y"WA,, (9.91) 


Sc is the counterterm action. In renormalizing the theory, we choose the 
counterterms to cancel the divergences to get a finite effective action I’. The 
counterterms are thus determined by the calculation of the regularized inte- 
grals which isolate potential divergences as A — oo. As a result, we have the 
following property. 


If the classical theory has a symmetry which is preserved by the 
regulators, then the counterterm action (and hence the functional 
integral) will have the same symmetry. 


An example of this is gauge-invariance in QED. The classical action has the 
gauge-invariance given by 


Sct (A’, wy’, w’) =e Sel (A, wb, ) 


Ai, = Ap + 9,6 
a! = ea) 
py =e ty (9.92) 


If we use a gauge-invariant regulator, we can expect Sc to have this symmetry. 
This leads immediately to Z, = Zj. Thus, with a gauge-invariant regulator, 
one can restrict Sc to be as given in (9.91), but with Z, set equal to Zp». 
(The functional integral involves a term $(0-.A)*, which is clearly not gauge- 
invariant. However, this does not affect our argument because A, is coupled 
to a conserved current. This will become clearer when we discuss the Ward- 
Takahashi identities in more detail in Chapter 11.) 

The counterterms one has to choose have the same structure in terms of 
fields and derivatives as the terms in the action. This can be understood as 
follows. Suppose there is a term of the form © which is a function of the 
fields and derivatives and which can be generated with a divergent coefficient 
c(A) as a result of a loop claculation, up to some order. We must then have 
a term ZO in the action to cancel the divergent part of this. Putting the two 
together, we then have (Z + c)O in the effective action. The A-dependent 
terms in the combination Z +c cancel out, but there is no a priori theoretical 
reason to say that Z-+c is zero; it could be any finite number AR, whose value 
must be taken from experiments. It should be viewed as another coupling 
constant in the theory. We can restate the above result as saying that there 
is a term \oQ in the classical action, which we then write as (Ar + Z)O. 
This shows that the counterterms are of the same nature as the terms in 
the action. If a particular kind of term is forbidden by symmetry arguments 
and the regulator preserves that symmetry, then it will not be generated by 
loop calculations and so can be excluded consistently from the action and 
the counterterms. 
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In the case of QED, all terms in the action have dimensions less than 
or equal to zero. (We include the d*z in the power counting.) From the 
superficial degree of divergence 6, we see that all the terms which can be 
generated with divergent coefficients are also of dimensions less than or equal 
to zero. This result is true for other field theories as well. In general, we have 
the result that an action of the form S = 5°); a;O;(~) where dimO; < 0 leads 
to counterterms with dimension < 0. (At the level of the Lagrangian, local 
monomials of the field and its derivatives which have dimension < 4 will lead 
to monomials of dimension < 4.) Thus the most general combination of terms 
made up of the fields of interest which are of dimension < 0 can be expected to 
be renormalizable. This means that after renormalization, the effective action 
will have a finite number of undetermined parameters or coupling constants 
a;. Once the values of these parameters are taken from experiments, we can 
then make predictions using the functional integral. Thus, in the case of QED, 
we may expect a renormalizable action to be of the form 


S= 3 ai0;(A, p, v) (9.93) 


where ©; are monomials of fields and derivatives which are of dimension 
< 0. (Strictly speaking, the action must be somewhat more restricted, since 
a mass term for the photon can lead to bad ultraviolet behavior for the prop- 
agator. This is briefly discussed in the next chapter; for now, we ignore this 
complication, because we are imposing gauge-invariance anyway.) Without 
symmetries, this is the best we can say. For QED we have the symmetries 


1. Lorentz-invariance 
2. Gauge-invariance 
3. Parity and charge conjugation 


Since there exists a regulator respecting these symmetries, and if we use 
such a regulator, we can choose a more restricted action which has these 
symmetries. The most general action consistent with these symmetries and 
with terms of dimension < 0 is (9.90) and so it can be a renormalizable 
theory. 

So far we have used the superficial degree of divergence. The fact that our 
analysis holds in general is due to the following convergence theorem due to 
Weinberg. 


If 6 for a graph and all its subgraphs is < 0, the corresponding integral 
is absolutely convergent. 


In building up the theory, one has to use this in conjunction with Bogoli- 
ubov’s recursion formula. Consider, for example, the O(h) terms. By consid- 
ering graphs with 6 > 0, we see that all divergent terms are of dimension < 0. 
At O(h?), consider Rg. We have already subtracted out subdivergences, so 
the kind of divergences generated by Re, viz., T¢Rg are analyzed by consid- 
ering just the superficial degree of divergence of G, since the subgraphs are 
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convergent. Analyzing terms with 6(G) > 0 shows that we again have only 
dim < 0 terms as potential divergences. Thus recursively, one is led to the 
result that 


dim < 0 terms => potential divergent terms of dim < 0 


This result combined with the symmetry argument can prove the perturbative 
renormalizability of QED. 

If terms with dim > 0 are used, we lose renormalizability. For example, 
consider if d4x(yw)? which is of dimension 2. This can lead to the graph 
shown in figure 9.14. The contribution of this diagram is divergent and re- 
quires counterterms of the type f d*z(y)%. This means that we should also 
have a term of this type in the action with some arbitrary coupling constant. 
Such a term, in turn, leads to the graph shown in figure 9.15. The contribu- 
tion of this diagram is also divergent and needs a counterterm of the type 
f d*x(w)®. This leads to more divergent graphs in turn and we end up with 
an infinite set of counterterms and hence an infinite set of coupling constants 
in the theory. Thus, quite generally, the addition of a term of dimension > 0 
in the action leads to a non-renormalizable theory. 


Fig 9.14. A one-loop 6-fermion vertex generated by (wy)? 


Fig 9.15. A graph generated by (#7)? 
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Some interesting examples of renormalizable theories are QED, nonabelian 
gauge theories, scalar field theory with a y*-interaction, and scalar QED with 
the action 


S= pes (0 + ieA)d|? + m?¢*¢ + A(¢"6)? + FP +4(0-A)?| (9.94) 
Some prominent examples of non-renormalizable theories are the nonlinear 
sigma models, the V — A current-current theory of weak interactions, and 
Einstein’s theory of gravity. 

The renormalization procedure can be and needs to be carried out in any 
theory. The difference between renormalizable and non-renormalizable theo- 
ries is then a matter of how one interprets and uses the effective action. For a 
renormalizable theory, one has a finite number of coupling constants and other 
parameters and predictions are straightforward. In a non-renormalizable the- 
ory, I" involves an infinite set of coupling constants. The action to be used in 
such a theory will have monomials ©; of the fields and derivatives of arbitrary 
dimension. The effect of a term of high dimension on low-energy processes is 
suppressed by powers of the energies involved. For example, consider a term 
like A f d+z(ww)?. The coupling constant \ must have dimension —2. So let 
us write it as g/M? where g is dimensionless and M is some constant with the 
dimension of mass which is characteristic of the theory. This term can con- 
tribute to two-particle scattering. The total cross section for such scattering, 
in the lowest order, for example, will go like g*p?/M*, just on dimensional 
grounds. Here p denotes some typical scale for the momenta of external parti- 
cles. This becomes negligibly small when p < M. Thus, even though we need 
to know the value of g to make a prediction about the two-particle scattering 
in general, we can actually make a prediction for low energies (low momenta) 
without knowing g, if g is not abnormally large. The range of validity of this 
is determined by the value of M, which will be a characteristic mass scale 
of the theory. We can thus use a non-renormalizable theory sensibly, make 
predictions with it and so on, if we restrict the range of validity of our pre- 
dictions to energy regimes which are low compared to the characteristic mass 
scale. 

One can also understand and estimate this mass scale as follows. The cross 
section we have calculated grows with energy. But cross sections have to be 
bounded by unitarity of the S-matrix. The growing cross section will violate 
this bound at some energy. This gives a rough estimate of the energy beyond 
which we will need the value of g to make predictions. (An explicit calculation 
along these lines is given in Chapter 13.) Notice that if we introduce a new 
set of particles (and fields) at this stage, the Hilbert space is enlarged and the 
implementation of unitarity changes. Thus the breakdown of unitarity can 
also be interpreted, in some cases, as the signal of producing new particles 
as intermediate states in collisions. In turn, we may consider this as a signal 
of the need to augment the theory with new particles. 
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Ultimately, rather than having an infinite set of coupling constants, we 
might consider it desirable to introduce relations among them, based on some 
symmetry perhaps, and adding some new fields as well. This would leave 
us with a finite number of parameters again and would be equivalent to 
embedding the theory in a renormalizable theory which would then have a 
much larger range of validity. 

As an example of how this might work out, consider the nonlinear sigma 
model which is used for describing low energy pion-nucleon dynamics. 


1 x 
sp -5 02 [ ate (x [(U-*d,U)?] + Ny-ON 
+mNUP_N+NUt Px) 


as 
La 5 (1 +s) (9.95) 


where N is a two-component column vector, each component of it is a 
four-spinor; N, denotes the proton field, Na denotes the neutron field, and 
U is a (2 x 2)-matrix denoting the triplet of pions y*, a = 1,2,3, via 
U = exp(ir°y?/V2 fr). 7? are the Pauli matrices, connecting the proton 
and neutron, and thus generating the isospin transformations. f, is a con- 
stant, called the pion decay constant. This theory is non-renormalizable, with 
interaction terms containing arbitrarily high powers of the pion field. One has 
to add an infinity of terms to (9.95) to make a renormalizable theory; the 
simplest such term would be 


Si =c / d*x Tr |(U-*0,0)*| (9.96) 


Pion-pion scattering shows that the theory (9.95) can be used up to approx- 
imately 47 f in energy. Around that value of energy, the contribution from 
the term (9.96) becomes important and one must add this as well. Terms of 
higher dimension than S; also become important at higher energies. Eventu- 
ally, one needs so many new fields and so many terms that one has to seek 
a better theory to which (9.95) is a low-energy approximation. In the case of 
pions and nucleons, such a theory is quantum chromodynamics (QCD). 

A similar situation holds for gravity as well. We can, and do, use Ein- 
stein’s theory to make low-energy predictions. The energy scale involved is 


the Planck mass Mp = (Gu)72 =~ 10!9GeV. (Gyn is Newton’s constant.) All 
experiments to date are far below this energy and so we have not faced any 
serious discrepancy in the theory of gravity yet. Terms of dimension higher 
than the Einstein action, such as the square of the Riemann tensor, should 
be there in the theory of gravity. The hope is that there is some symmetry 
which will gather up all such higher-dimensional terms, an infinity of them, 
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into an action with a finite number of parameters. One can then formulate 
gravity as a renormalizable theory, with possible infinities being restricted by 
the symmetry. We do not know what this is yet, but string theory may be 
such a theory. 

A note on regulators — 

When we change regulators, the divergent terms change by finite amounts. 
Thus the counterterms change by finite amounts as well. The parameters, as 
they appear in the action, are also changed. But they are determined by 
experiments and so we can specify them in terms of scattering amplitudes 
at certain chosen momenta. If we express the cross sections in terms of the 
value of scattering amplitudes at the chosen momenta, then the parameters 
are eliminated (in favor of the amplitudes at the chosen momenta) and the 
results are independent of the regulator. 

Given the relationship between the counterterms and the regulator, we 
can say that a choice of regulator is equivalent to a choice of (the coefficients 
of ) the counterterms. This shows that there is nothing particularly important 
about symmetries in a regulator either. If a regulator exists which has the 
desired symmetry, we can still choose to use a regulator which does not re- 
spect this symmetry. In this case, we simply have to have counterterms which 
do not respect the symmetry. One can then impose desirable symmetries at 
the level of the renormalized I’. For example, if we calculate the vacuum 
polarization graph in QED without a gauge-invariant regulator, we will find 
a term like A?.A?. This can be canceled by a counterterm pA’ by choosing 
p? appropriately. This counterterm does not have gauge invariance; this, by 
itself, is not a problem. Gauge invariance is important for eliminating the 
unphysical polarizations of the photon and thereby getting a unitary theory. 
We must thus impose it on the renormalized theory, which will require the 
counterterm A? to cancel the A? term generated from vacuum polarization 
exactly. This is in fact the procedure we used in retaining only the transverse 
part of 7” in the vacuum polarization. Thus, the use of a regulator which 
does not have gauge invariance is perfectly acceptable so long as we can im- 
pose gauge invariance on the renormalized action by suitably choosing the 
coefficients of the counterterms. Nevertheless, in practical calculations, espe- 
cially with many external particles and loops, it is algebraically much simpler 
if we choose a gauge-invariant regulator. (We might also note that the exper- 
imental upper bound on the photon mass is very small, < 2 x 107/%eV.) 

There are situations when there exists no regulator which has all the 
symmetries of the classical action. Typically, this happens when we have 
many symmetries and we cannot have a regulator which preserves all of 
them. In this case, it becomes impossible to impose all the symmetries at 
the level of the renormalized theory. We impose the most important ones, 
such as those required by unitarity, and use a I which breaks some of the 
other symmetries. The symmetries which are broken by the regularization 
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process are said to be anomalous. This is discussed in more detail in Chapter 
13s 


9.8 RG equation for the scalar field 


The fact that the Green’s functions G(x1, 22, ..., xn) of the renormalized the- 
ory do not depend on the cut-off A leads to constraints on their asymptotic 
behavior. This can be expressed by an equation which tells us how the Green’s 
functions behave under a scale change of the momenta or the coordinates. 
This is the renormalization group (RG) equation. The RG equation can be 
used to determine the asymptotic behavior of Green’s functions. It has also 
been used to determine the critical exponents of various field-theory models 
in statistical mechanics. In this section, we will derive the RG equations for 
a scalar field theory. 
The Green’s functions for a massless scalar field theory are given by 


G(a1, 22, ...,2N) =N fide e-> (a1) y(x2)--- y(ay) (9.97) 


where 4 
S= ‘| d‘x | 520(4)(0%) + AZ (A)os| (9.98) 
The Green’s functions so-defined are finite as A — oo. In terms of x defined 
i 

by y = Z, 7x, we have 

S= jas Foss +o | 

do = AZ (A)Z;z 7(A) (9.99) 
Ao depends on \ and A. We can invert this relation and write \ as a function 
of Xo and A; i.e., A = A(Ao, A). Since A does not depend on the cut-off A, it 


must be that the A-dependence of Apo cancels out the explicit A-dependence 
of the function 4. In other words 


Or OXo Or ) 
—} = —— = 9.100 
(oe) Bivcs y (se3 ” 210) 
In terms of the variables x, the Green’s functions can be expressed as 
Glo, 22,42) = ZN Pde) e-$ x(ar)x(02) ---x(ew) 


= 25-2 ( x(a1)x(a2)-+-x(en) ) (9.101) 


where we have used the abbreviation 
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(0) =" ‘i iene (9.102) 


We now turn to the RG equations. The A-independence of the Green’s 
functions gives 


d 
dlog A 
Using the expression for G as given in (9.101) and differentiating through, 
we find 
N dlog 23 
2 dlog A 


G( 21,29; -:+) DN eee ea (9.103) 


G a x(21)x(Z2) +--+ x(zNn) (sa) )=0 (9.104) 


In order to simplify this further, we need the cut-off dependence of the action 
S. This comes from two sources, one due to the cut-off dependence of Apo in 
the interaction term and the other in the integrals of the monomials of the 
fields. The latter point becomes clear if we write the expressions in momentum 
space. For example, 


4 
so(Ayet fate (axy?=tf 22 xp x) — @.108) 
2 oF 


2< A2 (27 ) 
The largest value of momentum is given by A, or equivalently, there is a 
short-distance cut-off for spatial separations; we need |x — y| > 1/A. If we 
replace A by (1+ .0)A, we must have |z — y| > 1/(1+0)A. The integral 


can then be related to the integral with cut-off A by the change of variables 
x—>z=z2 (1+0), |z-2z'| >1/A. Thus 


i I! 
er oi 5 f ae (Ox)" Ree ~ 2 (ito)? fa: (OX) | iste 
= ; i! d*z (0x)? =i‘ C9106) 
where 
X(z) = x(z/1+0¢) 
7 5 +1) x(z) 
= x — o(Dy) (9.107) 


The effect of the change of cut-off is thus to change the fields y by o(Dx) 
where 


) 
Dx = («. az 4 1) x 2 (9.108) 


gives the effect of a dilatation or scale change on the fields. The result (9.106) 
can thus be written as 
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dSo ws A 6 
imei / d's (Dx) & (9.109) 


For the interaction term, we have a similar expression and, in addition, we 
have the change due to the explicit A-dependence of Apo. 
dSint 


a fa Poe dro f at 4 


dlog A 
Using these results in (9.104) we find 


N dl 
FT Ot 298 xleryxte)---xtew) (f vgs) ) 
= Fg aN xe )xlata)-~-xCeew) fa x4) =0 


(9.111) 


A change of variables x — x’ = y + 6x in the functional integral leads to 
the identity 


/ fax] eS O(x) = f lax] eS 01) 


= ‘i [dx] e~S™ O(x) + / [dx] e~$©) |50 - 0 58] 
(9.112) 


In other words 


(60) — (O 6S) =0 (9.113) 
Equation(9.111) can now be simplified as 


re) N dlog Z3 
bye Ox; oo cy a ee | idee. EN) 
OXo 


~ log A 


Zag NIAC (as x(a) -x(y2) / hone en 
(9.114) 


The term with the insertion of the interaction term [ x* can be obtained from 
the coupling constant dependence of the Green’s function. Differentiating the 
expression (9.101) for the Green’s function 


re) N Olog Z3 
—— (a G 


= Z5-N/(x(0)x(0a)---x(ew) fx) (9.118) 


Here we consider Z3 as a function of Ap and A. Further, writing 
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dlog Z3 Olog Z3 OXo € log Z3 ) 
= oe ——— 9.116 
Eder ( Dro ) DlogA* \ Glog A,’ uetie) 
equation — becomes 
O45 COD 
= > . OAT7. 
where ee 
0g 43 
=—— 118 
bars (my | a 


Finally, we want to express this in terms of derivatives with respect to the 
renormalized coupling constant \ rather than Xo. Since po is a function of A 
as given by (9.99), 


Do 0 g__a (AA) aC 
OlogAO\0 ~=—SsC« Os log A \ Oo A OA 


__ (a) a 
a OlogA/,, OA 
OG 
= BA) ay (9.119) 


where we have also used (9.100). Combining this with (9.117), we get the 
renormalization group (RG) equation 


oa +N(1+7) + Bay a Gey 2s; ay 0 (9.120) 


The quantities G and ¥ in this equation are, once again, 


On 
Pe er 


_ 1 (Alog Zs 
a ( dlog A i eae 


These are functions of the coupling constant A. 

For the y* theory it is easy to calculate @ and 7 to the lowest order 
in perturbation theory using our explicit formulae for the renormalization 
constants. The results are 


BA) = mt 


1A) = = Tee 


In the noninteracting theory, 6 and ¥ are zero and the RG-equation shows 
that the Green’s functions behave as functions of dimension N, or equiva- 
lently, the fields y have dimension 1. This is also clear from (9.108) and 


(9.122) 
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(9.109), which show the effect of a dilatation on the fields. In the interacting 
theory, the Green’s functions no longer have any scaling behavior in general. 
However, in the special case when ( = 0, there is still scaling behavior; the 
dimension of N-point Green’s function is then N(1 +), not just N. For 
this reason, y is referred to as the anomalous dimension of the field y. ( is 
a measure of the scaling violations due to the interactions; it is referred to 
as the G-function. Even though there is, in general, no scaling behavior for 
the Green’s functions in the interacting theory, scaling behavior may occur 
at values of the coupling for which @ = 0. In other words, the theory can be 
at a zero of the @-function. Such zeroes can occur at very high or at very low 
momenta, depending on the theory. In order to show how this can happen 
and how scaling behavior is obtained, we shall need to solve the RG-equation. 


9.9 Solution to the RG equation and critical behavior 


The solution is most easily obtained in terms of an effective coupling constant 
A, referred to as a running coupling constant. It is defined by 


dX 2 

— =-B(A 9.123 

=~) (9.123) 
Here s is a scaling parameter, which scales the coordinates as x — e*x. We 


may write (9.123) as : 
r 
du 
s=-| =~ (9.124) 
[ B(u) 
Notice that \ is obtained as a function of \ and s and we have chosen the 


initial condition 1 = » at s = 0. Differentiating both sides of (9.124) with 
respect to A, we also find 


po) =a) (9.125) 


We may now write the solution of the RG equation as 


5 * Au) 
G(e°x}, e°22,...,€°2N, A) = G(x, X2,...,0N,) exp ‘ Blu) (9.126) 


where A = N(1+ 7). One can check this solution by differentiating with 
respect to s. This will give the equation 


Lg —G= = ~BO) 55 AG (9.127) 


By directly differentiating with respect to A and using (9.125), we find 


B(A) —— = geg2e ~ G LAnjG-Awe (9.128) 
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The last two equations (9.127) and (9.128) verify that G given in (9.126) is 
indeed the solution of the RG equation. 

The nature of the solution can be understood by first considering the 
zeroes of the G-function. Perturbatively, 3 is zero at 4 = 0. In general, one 
may have other zeroes for the 3-function. Let A, denote a zero of 2, 3(A.) = 0. 
The RG equation (9.120) can be trivially solved at this value of the coupling 
constant, by setting G = 0, as 


G(e°x,, e°a2, ...,€°aN, Ax) exp(—sA(A..))iC(Gapeoyegaeneds) lOrl29) 


This shows that the Green’s function has a simple scaling behavior with 
dimension A(A,) = N(1+7(A.)). N is the expected canonical dimension of 
G since y has dimension 1. y evaluated at A, is the anomalous dimension 
at the chosen value of coupling. Notice that once we are at a zero of the /- 
function, the running coupling is a constant with respect to scaling and does 
not run anymore. Zeroes of the (-function are therefore called fixed points. 
In the neighborhood of a fixed point »,, @ is given by 6 = B’(A.)(A — Ax). 
From equation (9.123), will approach the fixed point \, as s increases if 
B'(Ax) > 0. The Green’s function is, in general, a function of differences 
of coordinates x; — x;, so that an increase in s corresponds to considering 
processes at larger separations of points, in other words, the infrared limit. 
For this reason, we say that such a fixed point is an infrared stable fixed point. 
The Green’s functions or correlators of the theory show scaling behavior in the 
infrared limit; the theory is effectively at the value of the coupling constant 
Ax, which is the infrared stable fixed point. Because of the scaling behavior, 
we say the theory is critical. On the other hand, if 6’(\.) < 0, A approaches 
the fixed point as s decreases or as we go to shorter and shorter separations. 
This gives an ultraviolet stable fixed point and the theory can be described 
by the critical theory at large momenta. The approach to such fixed points is 
also important; if the approach to scaling is slow, say, logarithmic, we never 
attain criticality; we have only asymptotic criticality. 

The general solution (9.126) shows that we do not have scaling of Green’s 
functions in general. Nevertheless, there is a simple rule to obtain the behavior 
of the Green’s function under scaling. The effect of scaling the coordinates 
x is obtained by replacing the coupling constants in the Green’s function by 
the running coupling constant \ and then there is an extra exponential factor 
due to the dimension of the function. 

In perturbation theory, we always have a zero of the (-function when 
the coupling constants are zero, since 3 is obtained as a power series in the 
coupling constants. Near zero coupling, if @ is positive, \ = 0 is an infrared 
stable fixed point; the theory tends to remain perturbative for processes at 
low momenta. This is indeed the case for the y+-theory we have considered. 
(The same property holds for QED as well.) In the ultraviolet region, we 
get a strongly coupled theory as the effective, or running, coupling grows. 
If @ is negative, the free theory is obtained in the ultraviolet limit, for for 


9.9 Solution to the RG equation and critical behavior 175 


processes at large momenta. This is in fact the case for nonabelian theories, 
if there are not too many matter fields. The high-energy limit of the theory 
is perturbative; the approach to the free theory is only logarithmic and so 
this case is referred to as asymptotic freedom. We will discuss asymptotic 
freedom in QCD in some more detail in the next chapter. 

The renormalization’ group has been extensively applied to the study of 
critical behavior of field theories. At a second-order phase transition, there 
is scaling behavior for correlators and it is possible to calculate the critical 
exponents for the correlators using RG techniques. We will give only a very 
simple example of this here. Consider the y*-theory in three space dimensions. 
Such a theory can be argued to be a good description for the critical point of 
the three-dimensional Ising model. The simplest way to apply our analysis is 
to regard the theory as being defined in 4 — € dimensions. For the particular 
case of interest, « = 1, but the idea is to regard € as an expansion parameter, 
setting it equal to 1 at the end. Then we can apply much of our analysis. The 
dimension of the field is now 1 — 5¢. The interaction term f d*~*z y* has a 
nonzero canonical dimension —e. By redoing the calculation in (9.110), we 
can see that the @ function has an additional term —e,, in addition to what 
is calculated in the theory at 4 — € dimensions. 


B(A) = B(A)|a~e — €A | (9.130) 
The perturbative calculation of G@ to the lowest order in X and € is 


2 
B)|t-e © BW & 55 (9.131) 


We now see that there is a fixed point or a zero of the 6-function at 


2 
ee: (9.132) 
9 
For the anomalous dimension we find 

3X" 

¥(A)|4—e & ¥(A)|4 = Té6n4 
s (9.133) 

Ax) © 358 ; 


The solution of the RG equation then shows that there is scaling behavior 
for the correlators and y has effective dimension 1 — 4€ + (Ax) © 5 + qo3- 
Since the Green’s function depends on differences of coordinates, we obtain 
the behavior ' 


ea (9.134) 


G(21,22) ~ 


where the critical exponent 7 = €7/54 = 1/54 to the order we have calcu- 
lated. Notice that the fixed-point value of the coupling and the exponent are 
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close enough to zero that perturbation theory is adequate for the calculation 
presented here. 

As a method of analyzing the three-dimensional theory, the e-expansion 
can be carried to higher orders; it is an asymptotic expansion, and so, the 
actual values of critical exponents will not be improved by calculating to 
arbitrarily high orders, they are best approximated. by. a calculation to a 
certain optimal order which depends on the theory. 

In principle, the correlators can be calculated by resummation of pertur- 
bation theory or by solving the Schwinger-Dyson equations of the theory. 
The RG method can thus be looked upon as an efficient way of solving the 
Schwinger-Dyson equations in the kinematic regime (either ultraviolet or in- 
frared) of interest. There is another related and very useful point of view: If 
we are interested in the infrared regime, we may think of the RG technique 
as a way of incorporating the effect of modes of the field of high momenta to 
obtain an effective action for the low-energy theory. 
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10 Gauge Theories 


10.1 The gauge principle 


We have already discussed the principle of minimal coupling in electrody- 
namics and how it can be viewed as the requirement of gauge-invariance. 
The coupling of a fermion field ~ to the electromagnetic field A,, is described 
by the Lagrangian 

L= iy" (0, —teA,) — m) p (10.1) 


It has the property of gauge-invariance, viz., it is invariant under the trans- 
formations 


w = ap! = ery), wp ae ap a pet? 
Ay Al, = Ay +8,0 (10.2) 


where @ is an arbitrary function on spacetime M. We can write the above 
set of transformations as 


py’ = gv, yp =og"* 
eA), = eA’ = g(eA,)g"* — i0,997* (10.3) 


where g(x) = exp(ze@(zx)) is a function on spacetime M taking values in the 
group U(1). (We have absorbed e into the function @.) In this chapter, we 
want to generalize the above principle of minimal coupling to groups other 
than U(1). 

Consider a set of fields ~;, 1 = 1,2,...,N, which can transform into each 
other under the action of the fundamental representation of SU(N). (In other 
words, 7; are elements of an N-dimensional vector space, which can carry the 
fundamental representation of SU(N).) Introduce a gauge transformation on 


yi by 
Wi = 9i3(2) 0; (10.4) 


where g;; is an element of SU(N) in the fundamental representation; i-e., it 
is an (N x N) unitary matrix with determinant equal to one. g;; depends 
on the spacetime points, and hence we have an SU(N)-valued function on 
spacetime. For any other representation R and a set of fields ¢, which carry 
this representation we can write 
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bry = Das(9) $6 ° (10.5) 


where Daa(g) is the representative of the element g in the representation R. 
One can likewise write down transformation laws for any group G and any 
chosen representation. 

Generalizing what we did earlier, we now define a “covariant” derivative 
on such w’s. Consider 0,~’ given by 


OW’ = (Ong) + 9(Ouv) (10.6) 


0, does not transform covariantly as w does, but there is an extra term 
0,,g. We can define a derivative which transforms covariantly, denoted D,v, 
given by 

Dub = (0, + Ap) o (10.7) 


We introduce a matrix potential A, and choose its transformation law so 
as to cancel the inhomogeneous term in the transformation of 0,7). In other 
words, we require the covariance condition 


D,(A%)(9b) = 9 (Du(A)¥) (10.8) 
From this it follows that the transformation law of A, is given by 
Ag = gA,g™ — (8,9)97 (20.9) 


The quantity (0,9)g~+ is Lie algebra valued; i-e., it has the form —it70,,0°e?(6) 
for g of the form g = exp(—zt*6°(x)). t* are matrix representatives of the 
generators of the Lie algebra of G and satisfy the commutation rules 


[iad ee aa (10.10) 


t* are hermitian matrices and f° are real constants; a,b,c = 1,2,...,dimG, 
where dimG is the dimension of the group. Since the 6°’s are dimG inde- 
pendent functions, we will need dimG A,,’s in general; further, A, is Lie 
algebra-valued, since it has to be added to Ogg~! as in (10.9). Thus we can 
write 
Ay = —it° Al, (10.11) 
a 


where A/, are real functions. We have absorbed the coupling constant e into 
Aj; it can be restored at any stage by replacing Af; by eA%. The gauge 
transformation law can be written, for infinitesimal transformation g ~ 1 — 
tegen <1, as 

AS = A, + it* (0,0% + fA? 6°) (10.12) 


The combination (0,07 + f abc A? 0°) is the covariant derivative for the adjoint 
representation. The Lie algebra (10.10) is satisfied by the matrices (T*),. = 
—if%° due to the Jacobi identity; this is the adjoint representation of the 
group. In this case, (10.7) can be simplified to 
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(Dypiites 0, ,om—aeah (eT *Y99” — (Os0% faa ) (10.13) 

We may thus write the infinitesimal transformation law (10.12) as 
Ay, — A, Si (D,e)" (10.14) 


The covariant derivative gives us a prescription for coupling A,’s to any 
matter field. What we do is to replace all ordinary derivatives by the covariant 
derivatives. Of course, one still needs a Lagrangian for the A,,’s themselves; 
this is provided by the field strength. The field strength tensor F',,, is defined 
by 

Fyy = DyDy — Dy Dy = [Dy Dy] 
= —it* (0,49 — O, Aa + fo AP Ac) 
= it" FY, (10.15) 


From the covariance of D,, we find 


FA y( (gp) = [TA ), D, (A?) (gv) iG [Diy] y= 9F wv (10.16) 


In other words, 


Piel V=saek ig * (10.17) 
Thus F),, transforms covariantly. Using Fy, = —it°Fi, and Go 
D*>(g)t*, we get 

POA) — 2 aye, (10.18) 


where D?°(g) is the adjoint representation of g. We choose the normalization 
of the t*-matrices to be given by Tr(t*t?) = $6%°. The adjoint representation 
of g can then be written as D®°(g) = aTr(t2gt’g pele 

One choice of a Lagrangian for A,, is 


1 il 
L= ar: ie = 5e2 It (Eoak oe) (10.19) 
From the covariant transformation law of Fy,, viz., Fyu,(A9) = 9Fuwwg7', 
it is clear that CL is gauge-invariant. The Lagrangian (10.19) is called the 
Yang-Mills Lagrangian. It is the generalization of the Maxwell Lagrangian 
for the electromagnetic field. It is the simplest generalization for a general 
Lie group, but there are other Lagrangians possible, at least in special cases. 
For example, in three spacetime dimensions another possibility is the so-called 
Chern-Simons term 


ee 2 yva 
b= (4,0.4c 2 5 Avy Aa) € (10.20) 


where ¢€#”% is the Levi-Civita symbol in three dimensions. (This Lagrangian 
is actually not invariant under all gauge transformations. It changes by a 
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total derivative under infinitesimal transformations; the action is invariant. 
There are also the so-called homotopically nontrivial transformations under 
which the action is not invariant, but e*° (which is what is important in the 
quantum theory) is invariant for integral values of «. Thus for these cases, 
one has a well-defined quantum theory.) At this stage we shall concentrate 
on the Yang-Mills type theories. ee 

The equations of motion for a Vane field A, avin coupling to matter 
fields is easily obtained from (10.19). 6F%, = (Dy = — D,6A,)* which gives 

2 (1/e*) Fa, (D,oAy). The eae of motion are thus 


255m 


PR) ok Aa 


=5 ; (10.21) 


where S,, is the matter part of the action, i.e., terms in the action other 
than the Yang-Mills term. Analogous to the identity 0. Fy, + cyclic = 0 for 
electrodynamics, we have the Bianchi identity 


DaFiy + DF ea — 0 (10.22) 


The Jacobi identity for commutators, viz., [Du,[Dv, Dal] + [Dv,|Da, Dul] + 
[Da, [Du, Dy]] = 0, gives the above result directly. 

The importance of the gauge principle is that it offers a uniform way to 
couple A,’s to matter of different charges. We have already seen this in the 
case of electrodynamics. Thus in QED, for fields corresponding to particles 
of charge n, we have 


Dut = (0, — inaAy)p 
y! = ei (10.23) 


For a nonabelian Lie group G, the analog of different charges would be dif- 
ferent representations. Thus for a general representation R we have 


Duh = (O, — itRAl)y 
y’ = Dr(g) v (10.24) 


The A’ are always the same, dimG in number. t% are the matrix representa- 
tives of the generators of the Lie algebra of G in the R-representation. There 
is only one coupling constant e for each simple group. For example, if we con- 
sider the group SU(2) with matter fields in the representations of dimensions 
2 and 3 (namely, for j-values 3 and 1), we have 


a 


a o Q - a 
Yor @=12 (Mae=(F) .  DyWa=Buta-i( Fase) 


b; b= 1, 2,3, (t*)be = the (Dud)* ras um te eu AG 
(10.25) 
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10.2 Parallel transport 


The matter fields ~, ¢, etc., can be considered as the components of a dimR- 
vector with respect to some chosen basis in a dimR-dimensional linear vector 
space V. In other words, we can write 


p=wie EV; (10.26) 


where e; are a set of basis vectors for V at the point x“. The gauge trans- 
formation ~/ = g;;7); is equivalent to choosing a different basis for V. Since 
g depends on x“, the gauge invariance of the theory reflects the fact that 
physics is independent of our choice of basis (frame) at each point labeled by 
xz". We can choose frames independently at each spacetime point, which is, 
arguably, a sensible requirement even on a priori grounds. After all, it would 
be surprising if the ad hoc or conventional choice of a frame in this field space 
were to affect physical results. The fact that physics is independent of the 
local choice of frames is analogous to the case of the general theory of rela- 
tivity, except that in the latter case V, is not an arbitrary vector space, but 
the tangent space at x<” to the spacetime manifold. 

The field strength F,, is the gauge theory analog of the Riemann cur- 
vature Rep, and can also be understood as the angular deficit for parallel 
transport around small loops on the spacetime M, with the qualification 
that the angular rotation is a frame rotation in the internal field space of the 
w’s. 

One can analyze parallel transport by considering covariantly constant 
w’s. Consider 

Du = (0, + Ay) =0 (10.27) 
In this equation, 7 is a column vector on which the matrices t® in A, can 
act by matrix multiplication. Introduce a U(z) defined by 


(3, + A,)U(x) =0 (10.28) 


Such a U does not exist in general as a well-defined function on M, but 
one can integrate the above equation along curves C' from, say, y” to x’ to 
get a path-dependent U(z, y, C). In other words, we are solving the equation 
C#(0, + A,)U(x) = 0, where C” is tangent to the curve C. 


U(zx,y,C) = [1 — Ay, (2 — ©)e"*][1 — Ay, (x — 2e)e4?] --- 
“++ [1 = Apy (yer 


z 
= P exp (- A,dc" 
yC 


= P exp ( / ; t*Agdo"*) (10.29) 


where the symbol P denotes the ordering of the matrices A, along the path 
as indicated by the first of the equalities above. We divide the path into N 
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intervals of small displacements e“ and write an ordered product of factors 
like {1 — A,,(a — €)e#] and eventually take « — 0, N — oo, keeping the total 
displacement finite and the order of multiplications unchanged. It is then easy 
to check, from the first of the above equations, that U(x,y,C) does indeed 
solve (10.28). Given U, D7 = 0 is solved by 


y=U eee (10.30) 


with wo = contant. Thus U(x, y, C) tells us how frames rotate as we transport 
along C from y to 2. 

The set of equations D,,U = 0 (corresponding to different values of y) 
have, as the integrability condition, F,,, = 0. Thus U(z,y,C) would be in- 
dependent of the curve C and give a well-defined function on M if and only 
if Fy, = 0. (If there are noncontractible loops on M, it is possible to get 
more general solutions. U(x, y) defined by integration along an open path 
C need not be the same as what is obtained by integration along C with 
an added circuit around a noncontractible loop, since the two paths cannot 
be deformed into each other. For simply connected spacetimes, F,,, = 0 will 
give a path-independent function U(x) as the result of the integration.) If 
Fv = 0, we can write, for simply connected spaces, 


A= —(0,07U— ; (10.31) 
If Fv # 0, we see that, for a small closed loop, we have 
Cc 


where o”” is the area element of the infinitesimal surface whose boundary is 
the curve C. (The above result is essentially Stokes’ theorem. This theorem 
does not work, as it is, for larger loops, because of the matrix nature of the 
A,’s which means that A’s at different points do not necessarily commute 
with each other. For small loops, the above result can, however, be seen by 
direct expansion of the path-ordered exponential.) The above result (10.32), 
along with (10.30), shows that F,, measures the “angular deficit” for parallel 
transport around small loops. 

Using the equation D,U = 0, one can check that, under a gauge trans- 
formation of the potentials A,,, we have 


U(z,y, C, Al) = g(a) U(z,y,C,4,) 9 GY) (10.33) 


For a closed loop, where z and y coincide, we have a similarity transformation 
of U by g(x) and hence the trace is invariant. The holonomy operator or the 
Wilson loop operator W(C) is defined by 


W(C) =T [P ae (- f Aydo" 
— Ty E ae ( £ re Azdo" (10.34) 
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10.3 Charges and gauge transformations 


The Lagrangian for a gauge theory is invariant under the gauge transforma- 
tions 


A, > AZ = a — (d,9)97", wy wo = gw (10.35) 


Consider first the constant transformations for which g(x) is independent of 
x". For these cases, the gauge symmetry is just like any global symmetry. 


Ay + AS =gA,g", yp yt = gp (10.36) 


Thus we must expect a Noether current and associated charges. We can find 
the current by considering the variation of the Lagrangian as discussed in 
Chapter 3. As an example, we shall take the matter field to be fermions 
coupled to A,. We then find 


1 ie 
6£L= eae ad) + ipy"O,0W +++ 


i = 
= On |- = ( Fe #*§ AS) + iwy*dw| + terms proportional to 
e€ 


equations of motion 
(10.37) 


Thus the Noether current J* is given by 
Jonge = 5 (F648) + iby" by) (10.38) 


where 6° are the (infinitesimal) parameters of the transformation. From 
(10.36), 6A¢ = —f°A>@*, dp = —it°O°y. The current is thus identified 
as 

JOH = 5 fOP AS + Gyhiry (10.39) 


This current is easily checked to be conserved by the equations of motion. 
The equations of motion for the gauge field are 


il = 
35 (Du FH)? = byw (10.40) 
Using this equation and the definition of the covariant derivative, we find 
apy 
J°# = 0, | (10.41) 
e 
The corresponding charges can thus be written as 
Or = A gt ae ce Feds; (10.42) 
€* J \x|—00 
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The charges are given by two-surface integrals. This is a general feature of all 
gauge theories. (For example, in the general theory of relativity, for asymp- 
totically flat spacetimes, the charges corresponding to coordinate transfor- 
mations, viz., momentum and angular momentum, are defined by surface 
integrals at spatial infinity. Generically, this has to do with the fact that 
charge densities cannot be defined in a gauge-invariant manner; compare 
with equation (10.39), for example. Also notice that equation (10.39) shows 
that there can be nonzero charge even when the matter fields are zero; the 
gauge fields themselves are charged in the nonabelian theory. ) 

Since the constant transformations (10.36) act like a Noether symmetry, 
we expect that the wave functions or states transform as representations of 
the corresponding symmetry group, i.e., : 


\W9) = ef Wy). (10.43) 


This result can be shown as follows. The general variation of the Lagrangian 
is given by (10.37). From the surface term in the action, we can identify the 
canonical one-form as 


e= ‘) Bx |-aFe as + ityPsu| 
ii i 
= je | aPaeAs + itnP6s| (10.44) 


This canonical one-form shows one of the difficulties of quantizing the gauge 
theory; there is no canonical momentum for Aj. However, as we have seen 
in the case of electrodynamics, we can set Aj = 0 by a choice of gauge. 
For the remaining components, the canonical one-form (10.44) leads to the 
commutation rules 


[A?(a, t), A2(y, t)] =0 
[Af (a, t), FG; (y, t)] = 157°5;;6° (x — y) (10.45) 
[Foi (a, t), Fo, (y,t)] = 0 


The fermion fields have the usual anticommutation rules. 

As in the case of electrodynamics, the Gauss law requires special treat- 
ment. It is part of the Lagrangian equations of motion. Since it does not 
involve any time-derivatives, in a Hamiltonian formalism, it cannot be ob- 
tained as an equation of motion, but must be imposed as a condition selecting 
allowed initial data in the classical analysis. In our quantum theoretic treat- 
ment of electrodynamics, we solved the Gauss law, eliminating the longitu- 
dinal component of A; which led to a set of unconstrained fields and then 
used the commutation rules for these unconstrained fields. One can carry 
through an analogous reduction here, but an alternate approach is to impose 
the Gauss law as a condition on the physical states. Recall that if we elim- 
inate the longitudinal part of Aj, it is given in terms of the charge density 
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which involves w and 1; as a result, those commutation rules in which the 
longitudinal part of A; appears will involve the fermion fields. The alternate 
approach of imposing the Gauss law on states has the advantage that we can 
have the commutation rules (10.45) and so the operator structure is simpler. 
However, there are states in the Hilbert space which are generated by the 
action of longitudinal components of A? on the vaccum. These are clearly 
unphysical states since they are absent if we solve the Gauss law to eliminate 
the longitudinal components. Physical states of the theory in this approach 
are then defined as those states which obey the condition 


G? |W) =0 (10.46) 


In other words, we do not require G* = 0 on all states (or as an operator 
equality). It is a condition selecting physical states in the Hilbert space. This 
condition, (10.46), will still ensure that the Gauss law is obtained for matrix 
elements with physical states, in particular, for expectation values of gauge- 
invariant operators for physical states. This is sufficient for observable results 
of the theory to be consistent with the Gauss law. 

For the nonabelian theory, the Gauss law is identified from the time- 
component of the equations of motion as 


G(é) = / Bx 9% |r) 4h vieey| (10.47) 


Consider now the transformation 


5At = —(D,6)? 
dy = ~it*0%y, oy! = plitr62 (10.48) 


which is an infinitesimal gauge transformation, but we do not necessarily re- 
quire that the parameters vanish at spatial infinity; instead, they could be 
nonzero but constant (independent of angular directions) at spatial infinity. 
The canonical commutation rules show that the generator of this transfor- 
mation is 


G(0) = ‘A ar larm(Dy + rule (10.49) 


(This gives i6A¢ = [A?, G(6)], etc.) Notice that the operator (10.49) differs 
from the Gauss law (10.47) by a surface term. The variation of a state when 
the transformation (10.48) is carried out is given by 


|W) = iG(0) |v) (10.50) 


Consider now states which obey the Gauss law condition (10.46), i.e., physical 
states. On these, for transformations for which the 0° go to constants not 
equal to zero at spatial infinity, we can carry out a partial integration to 
obtain 
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jw) = f | 9° Figs. 1) 


= iQ°6° |W) . (10.51) 
This shows that the constant transformations do act as a Noether symmetry 
on the physical states with the charge operator as giver hy (10.42). For trans- 
formations for which 9% go to zero at spatial infinity, there is no surface term 
and G(@) and G(@) coincide. The requirement (10.46) shows that physical 
states are invariant under such transformations. 
We can now summarize the basic result of the analysis given above. 


1. For transformations g(x) which go to the identity 1 at spatial infinity, 
i.e., for 9* — 0, the physical states |W) are invariant. 

2. For transformations g(x) which go to a constant element g.. which is not 
the identity, or for constant g over all space, 


6|W) = iQ°O° |W) (10.52) 
This result can also be restated as follows. Define 


(ope {set of all g(x) such that g(x) — 1 as |x| co} 
C= {set of all g(x) such that g(x) — constant element of G, 


not necessarily 1, as |x| > co} 


The results we have obtained above then amount to saying that the physical 
states are invariant (and not just covariant) under G,. Thus G, is the “true 
gauge symmetry” of the theory in the sense that its elements represent un- 
physical, and hence redundant, variables in the theory. Since the elements of 
G go to a constant, not necessarily the identity, at spatial infinity, we have 


G/G. ~ set of constant g’s ~G 
G/G, ~ G is the Noether symmetry of the theory defined by the charges. 


10.4 Functional quantization of gauge theories 


In Chapter 8, we showed that for scalar fields and fermions, we had a simple 
prescription to obtain the generating functional as a functional integral, viz., 
given the classical Euclidean action Sg one could obtain Z[J, 7, n] as 


ZelJ, 7,0] =N / du (y, , W) exp [Sele ~~) + / d‘x (Jp + iw + in| 
(10.53) 
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where dy is an integration measure on the space of configurations, that is, on 
C= {9(a),U(@),8(@) | (2): M > R,U(2), B(2):-M > N} 


where R denotes real numbers and N denotes Grassmann numbers. (The 
specific measure we used for defining du was the Euclidean metric on C.) 

The functional integral for QED was also obtained in Chapter 8, but it 
did not use the classical Maxwell action, but rather a modified version of it, 
which had to do with gauge-fixing. We shall now show that the prescription 
used for the scalar and fermion fields will apply to gauge fields as well, with 
the appropriate definition of the configuration space and its volume measure. 

We start by showing that the naive definition of the integration over all 
A, does not work. Let us start with the Euclidean Maxwell action 


1 1 
—_ ns Fy F* = 5 / AM (1 by» + 0,0,) AY 


4 
it 
=5 ch AM(2)Myv (2,9) A” (y) (10.54) 
zyy 
where 
Myr(z,y) = (—O du + O,0,) 5) (x—y) (10.55) 


If we had a scalar field, we would write 


z= | \dglexp (-5 | ome+ 30) 


= (et ae exp € / mus) (10.56) 


M~! would then be the propagator. In the case of the Maxwell action though, 
we cannot do this since M,,(x,y) has zero modes, i.e., det M = 0 and M~ 
does not exist. The existence of zero modes is due to gauge invariance. Under 
a gauge transformation, A,, > A, + 0,0. For a mode of the form ¢, = 0,0 
we find 


[ Muv(e. wo") = (11 by + 9,9- 4) =0 (10.57) 


In other words, ¢, = 0,9 is an eigenvector of M,,,(z, y) with eigenvalue zero. 
We must remove such gauge or unphysical degrees of freedom from the A,,’s 
to define a proper functional integral. 

In order to understand this better we must define the configuration space 
more carefully. We shall discuss gauge theories in general from now on, since 
the formalism is essentially the same for QED and other gauge theories. First 
we define the function space A, which is the space of all gauge potentials. 
This will be the set of all four-vector-valued functions A,,(z) which are also 
elements of the Lie algebra. Existence of certain integrals like f F? will be 
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assumed. When we write [dA] in a functional integral, we are writing a mea- 
sure on A defined by [],,,, dAj,(£). This corresponds to the volume defined 
by a Euclidean distance function or metric on A given by 


oe 
| 5A |2= / dr 6A9()6A9(2) (10.58) 
a 
The space A has the property that any two points A1,(z) and Ao,(z) 
can be connected by a straight line, which is in fact given by 


A, (7,2) = 7A; (2) + (1 — TY Ao, (2) (10.59) 


for 0 <7 < 1. Notice that A,,(7, 2) transforms, for all 7, as a gauge potential 
should, viz., A9 (7,2) =7 A{,(x) + (1—7) A3,,(z) for g = enue” A is thus 
an affine space; i.e., we can write any configuration A, as A,(z) = A) +S 
where Al?) is a fixed potential and €,,(x) is a Lie-algebra-valued vector field. 

In the previous section we have defined the set of gauge transformations 
G, and showed that not only the action, but the wave functions are invari- 
ant under G,. The space G, was defined in terms of gauge transformations 
at a fixed time as is appropriate for the discussion of states and operators 
acting on them. For the functional integral, we can generalize these ideas to 
a four-dimensional setting. We will use the same notation, G,, to denote the 
set of gauge transformations which go to the identity at large values of the 
Euclidean radius, i.e., as /z#xz" -— > oo. Physical configurations over which 
we must integrate are defined on A/G,. Any two potentials which differ by a 
gauge transformation, that is, by an element of G,, are the same physically 
and correspond to the same point in A/G,. In other words, the true space of 
physical configurations on R*4 is not A but 


C=A/G. _ (10.60) 


Given the configuration space C of (10.60), we can say that the correct 
functional integral for a gauge theory is given by 


Z= / du(A/Ge) e Serf Ian (10.61) 


where Sz is the classical Euclidean action and du(A/G,) is to be obtained 
from the metric (10.58) by factoring out the action of gauge transformations. 

While (10.61) is indeed the correct prescription, rarely can one evaluate 
the measure du(A/G,.) exactly. (This can be done for gauge fields in two 
dimensions.) One way to obtain the measure is the following. (There are 
many caveats which must be stated regarding the discussion which follows, 
having to do with global properties of various spaces inolved. We shall ignore 
them for the moment, they will be discussed separately.) Consider A,(z), 
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which is a point in A. Under a gauge transformation A, — Aj = gAug | — 
0,991, which corresponds to a different point in A. If we consider a sequence 
of transformations starting from g = 1, we see that gauge transformations 
generate a flow in A. In order to get rid of gauge degrees of freedom, we 
can choose a surface &’ which cuts these flow lines transversally. (A “good” 
surface 2’ should cut each flow line once and only once.) We can then pick 
the points on » as representatives of A/G,. The choice of X’ is specified by 
a condition on the potentials called a gauge-fixing condition. For example, 
all A’s which obey 0,,A°“ = 0 lie on some surface Y’g.4, all A’s which obey 
f°(A) = 0 lie on &y, etc. If the integrand is gauge-invariant, as is the case 
for the classical action, then the choice of &’ does not matter, since one can 
get from one 2’ to another by a gauge transformation. 

We now want to write du(A/G.). By definition of A/G,, we have, at least 
locally in this space, A ~ (A/G..) x G,. Thus 


[dA] = du (A/G.) du(G.) (10.62) 


For gauge transformations which are close to the identity, namely, g(x) ~ 
1 — it*@°(x), we can write du(G,) ~ [d0°(x)|. Let the gauge-fixing condition 
be 

f(A=t@)=0 (10.63) 


where h°(x) has no A-dependence. Multiplying both sides by a common 
factor, we can then write (10.62) as 


Of*(x) 


[dA] ace Fal = du(A/G.) [49] ace Sac | 


(10.64) 


The determinant of 
p(x, y) = 5f*(2)/66"(y) (10.65) 


is known as the Faddev-Popov determinant. We now multiply (10.64) by 
exp (—Sz(A) + f J%A2) 6[f(A) — A], where the 6-function is a functional 
6-function, and integrate. We obtain 


[lal s15(4)— m1 [det(Arr)lew (-Se(4) + f 7048) 


2 / du(A/Gx) exp (-se(4) + / sag 
x {{d6] |det(App)| 5[f — h]} 
= / dp(A/Gx) exp (-se(4) + ih rae (10.66) 


where we have used the identity 


iA (d6] \det(Arp)| df — h] = if (df] of — bh] =1 (10.67) 
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for integration along the gauge directions. The right-hand side of the (10.66) is 
our definition (10.61) of the functional integral over the proper configuration 
space. We thus have 


ZA / [dA] 6[f(A)—h] |det(Arp)|exp (-se(4) + sr rns.) (10.68) 


This gives a formula for Z;,[J] written conveniently in terms of the Euclidean 
measure [dA]; the redundancy is factored out by use of the 6-function. Strictly 
speaking, Z,[J] depends on the gauge-fixing condition, as indicated by the 
subscript. Writing the measure du(A/G,) in terms of A’s on the f —h = 0 
surface is correct only if the integrand is gauge-invariant. The J°” A/-term is 
not and thus Z;,[J] will depend on the gauge-fixing condition. Of course, Z[0] 
is gauge-invariant. The implication is that the Green’s functions defined by 
Zn|J] will be gauge-dependent in general. However, the S-matrix elements 
are not and so we can simplify (10.68) even further as follows. Since A is 
arbitrary, we can define the S-matrix using Z, , Zp or f [dh|F(h) Zp; these 
are all equivalent. We can thus define, for the purposes of computing the 
S-matrix, another Z by choosing F(h) = exp(—s235 f h?), where @ is just a 
real number. 


Z| = i (dA) 6b (A) ligase enn (-Se(A) + / Fig.) 


antes (- — [ H(c)) 


= [aA] |aet(Aep)lexp (-se(4) - Sigs | sonas_ 
(10.69) 


The 6-function has been removed in favor of a term f? which can be consid- 
ered as an extra term in the action. We can now make one more improvement 
which is helpful in perturbative calculations. From our discussions of Grass- 
mann integration 


‘i [dQdQ] ef °M@ — det (10.70) 


for independent Grassmann variables Q(x), Q(x). We may thus write 


ldet(App)| = [ idede exp (- [ (Fee) (10.71) 


We have taken the determinant to be positive, which it is in the region around 
the classical vacuum A, = 0. (Eventually the formula we obtain will not be 
applicable as it is for nonperturbative effects anyway, so that consideration 
of the region around A, = 0 is not too restrictive.) This provides a conve- 
nient way of writing the determinant as part of the action with additional 
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(Lie algebra-valued) fields c? and é, which is useful in writing down the 
rules for Feynman diagrams in perturbation theory. The fields c¢and é are 
Grassmann-valued, yet they are not spinors, but scalars. Thus they have the 
wrong spin-statistics connection. They are often referred to as the Faddeev- 
Popov ghosts. The wrong spin-statistics relation for the ghosts is not a prob- 
lem, because they do not appear in the external lines or as asymptotic states 
in the theory; they are merely a device to write the determinant in a conve- 
nient way. Using (10.71) in (10.69), we finally get 


Z\J) = i enue euep (-s.+ / Fm (10.72) 
whet 
Seeeows, + ; thee | saz h (A)ie(A) +20) (sae) | (10.73) 


The final prescription for the functional integral in a gauge theory is then 
quite simple. We have to choose a gauge-fixing function f(A) (which, by 
definition, cannot be gauge-invariant) and then construct S, by adding the 
f?-term to the classical action and also adding the Faddeev-Popov ghost 
term. Integration of e~*? with the standard Euclidean measure for ‘Ay, ¢ and 
é will then give the functional integral for Z[J]. 

We must now ask the question: what is a “good” gauge-fixing? As we 
remarked earlier, a good gauge-fixing must produce a surface which intersects 
each gauge flow line transversally once and only once. The existence of such 
a surface depends on the global properties of A/G,. If we choose a surface 
which is not transversal, then there is a direction of gauge variation which 
would not change f*. This means that the matrix A%’,(z, y) has zero modes 
and that the choice of f* does not fix the gauge completely. This can be 
taken care of by choosing a different function, at least locally in A around 
the point A, = 0, such that det Arp F 0. 

A more involved problem has to do with the fact that the gauge-fixing 
surface may intersect flow lines more than once. In this case, there are gauge- 
equivalent configurations on the gauge-fixing surface or there are nontrivial 
solutions g(x) to the condition 


f2(A2) —h=0 (10.74) 


(In this case, A and A9 lie on the same gauge-fixing surface.) In other words, 
there does not exist, globally over the configuration space, a surface which 
intersects the gauge flow lines once and only once. Integrating over all points 
on the chosen surface leads to overcounting of the degrees of freedom. This 
problem is known as the Gribov ambiguity. Since the gauge-flow line has to, 
roughly speaking, turn around to come back and intersect the gauge-fixing 
surface, we see that 6f/69 must vanish for some A along the flow line; the 
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Faddeev-Popov operator Arp once again has a zero mode at that point. The 
Gribov ambiguity is easily avoided for an Abelian gauge theory such as QED. 
However, it is unavoidable for all smooth gauge-fixings in a nonabelian gauge 
theory, due to the global properties of A and A/G,. (This will be discussed 
in some more detail in Chapter 14.) If we consider a gauge-fixing condition 
like 0,,A°% = 0, we see that Arp is positive at A, = 0 and so continues 
to be positive for a range of A’s which is perturbatively accessible around 
A, = 0. The Gribov problem does not affect perturbative calculations with 
our functional integral (10.72, 10.73). 


10.5 Examples 
1. Electrodynamics with f(A) = 0,A” 


If we choose f(A) = 0, A", Arp = —01,6 (x — y). We then find 


flies... 
Sy = [| AiwPH + s-0-AP +a-Die] (10.75) 
For the choice a = 1 (and scaling A — eA and adding on the fermion terms) 
we get our earlier expression for the QED functional integral (8.57, 8.58). 
The c, ¢-dependent term just gives a constant multiplicative factor det(—D) 
which can be absorbed into the normalization constant. The limit a — 0 is 
known as the Landau gauge. 

This derivation of the functional integral for QED justifies the use of the 
covariant propagator for the photon in calculating the S-matrix. 


2. Electrodynamics with f(A) = 0,A" + A,A¥ 


A simple example of a nonlinear gauge in electrodynamics is given by 
f(A) = 0, A" + A,,A“. In this case, we find App = —(0, +2A-0)56 (x —y), 


giving 

Sy= [| |\AFuwF + —(0-At AP +e 

a= | \qakwF™ + 5a (0 A+ A)’ +e-O1-2A4-d)c} (10.76) 
The ghosts now interact with the photons and cannot be ignored. 


3. Nonabelian gauge theory with f* = 0,,A% 


In this case, with f* = 0- A*, we find A%,(z, y) = —(0,D*)"56 (a — 
y) where D,, is the covariant derivative (in the adjoint representation). S, 
becomes 


if 1 
S, = / aah Fou 4 gael? : Ase ae dc (0,.c° ae a) (10.77) 
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Choosing a = 1 and rescaling A — eA, 
OF = So + Sint 
1 
oo = i [5a.azonas + 0°70,,c° 


, 2 
Sex = ‘i lersta,azabn ar fe ae a del ee eye Oe 
(10.78) 


In this case, even though the gauge-fixing condition is linear in the gauge 
fields, the ghosts are unavoidable since they interact with the gauge fields. 
The rules for Feynman diagrams can be read off from (10.78). In particular, 
the (Euclidean) propagators for the gauge field and the ghost field are given 
by 


A 
(A2(x)A°(y)) = 6 oe ak 1 eik(e—y) 
be v ye (Ont 


atk 1 key) 


(27 Fake 


(eo (aan) yad:> (10.79) 


10.6 BRST symmetry and physical states 


In this section, we shall discuss some aspects of the operator formulation 
of the quantum theory of gauge fields. The functional integral involves a 
modified action as given in (10.73). A canonical operator quantization of 
this modified action may be carried out to obtain the states of the theory. 
An elegant way to understand the physical states and the elimination of 
gauge degrees of freedom in this approach is in terms of the so-called BRST 
symmetry, named after Becchi, Rouet, Stora, and Tyutin. In order to display 
this, we go back a step and write the action (10.73) as 


ae) 0) (10.80) 


where B*(z) is an auxiliary field. If we eliminate it by its equation of motion, 
namely, 


Sp=SB(A) +f fipese + Sew + 2°(2)( 


ae*B* = -if* (10.81) 
or if we integrate over B® in a functional integral with a standard Euclidean 


measure [dB], we get back the action (10.73). 
We now introduce the BRST transformation by 


Q At, = (Duc)® 


Oc= — $3 frcPce 


Q B*=0 (10.82) 
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The change in Aj, involves the ghost field; Q is Grassmann-valued and the 
parameters of me transformation must thus be considered as Grassmann 
variables. The transformations (10.82) are easy to write down. For the gauge 
field Av, we do an infinitesimal gauge transformation AZ; > Aj, — (Dah)? 

and replies the gauge parameter —0* by the ghost field cA This “— means 
that gauge-invariant quantities are BRST-invariant. Since all observables in 
a gauge theory must be gauge-invariant, we see that all observables are given 
by BRST-invariant expressions. Once the action of Q on Af is specified, the 
action of Q on c®, as given in (10.82), is determined by eeeiaieng O* =a. 
This can be emaaly checked as follows. 


Qo? AG ae 8,,(Qe*)+- f2-(O,c?)c° + f2° A? (Qc°) 
+ fe ge Anclc® ° (10.83) 
Choosing Qc® + ; f2¢chc® = 0 and writing 
f° (0,0 )6 = 10,,( feet’), 10.84 
be 2°u 
which follows from the Grassmann nature of the ghost fields, we get 
OQ? AS = ae PCAN Fckl cho ae fees 
—4 Pi aac og = leh ab ~ co Ava 
= 0 (10.85) 
where we have used the Jacobi identity on the structure constants, viz., 
i Rede toa a faye a6 frye? 0 (10.86) 
The Jacobi identity follows from the Lie algebra identity or matrix identity 
eel + eee + eee = 0 (10.87) 


upon using the commutation rules (10.10). One can also check, by using the 
Jacobi identity, that Q? is zero on the ghost field c* as well, with the choice 
of the transformation rule Qc* + A f2¢c’ce = 0. For the antighost field 2°, 
the action of Q should produce a bosonic field; we can simply define this as 
iB*. O B® is then taken to be zero, so that Q? = 0 on @. Q? is trivially zero 
on B® itself. 

With the BRST transformations (10.82) we can write 


=a a EGE on a * Da ae? apa aa ie b 
oe(A) - i oR] =iB fl + — BSB +e (55) (10.88) 


The modified action (10.80) to be used in the functional integral can thus be 
written as 
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S,= Se(A) + | ise + fo SS pope + 24() (Gay) o@)| 
nA OC / [e (1259) (10.89) 


Since the action of Q on A’, is an infinitesimal gauge transformation with 
parameter —c* and Sz is gauge-invariant, we have 


QS,=0 (10.90) 


using QO? = 0. This BRST-invariance expresses, for the gauge-fixed action Sa 
the effect of the gauge-invariance of the theory. 

The BRST-invariance leads to a conserved current and charge, which 
we now derive. For simplicity, consider the gauge choice f* = 0- A*. The 
Lagrangian for S, can be written, in Minkowski space, as 


if * 4 ae? 
L= “sal +i BO" Al, + 042° (Dyc)* + a (10.91) 
The variation of this Lagrangian is 
1 P 
dL = 0H ~ aa OAL + iB*5Al, + 60°(Dyc)* + 0,676" 
+ equations of motion (10.92) 


Under a BRST transformation, £ in (10.91) changes as 6£ = 0,K¥" = 
0,,{iB°(D*c)*]. using the general formula for Noether currents from Chapter 
3 and the transformations (10.82), we can identify the current as 


1 ; a 
Jy = — =F" (Dee) + iB*(Dyc)* + 5 f2°0,ec8c° (10.93) 
The Grassmann nature of the variations is important for this; it is useful 
to introduce a Grassmann-valued parameter for the variations due to Q and 


then identify the current after moving this parameter to the left end of all 
terms in 6£. The charge corresponding to (10.93) is 


G= je E F@.(Dic)® +iB*(Doc)* + $f2°Ope%c a (10.94) 
(We use Q for the canonical charge, Q for the BRST variation in the func- 
tional language.) From the Lagrangian (10.91), the canonical one-form is 
given by 

e= pes | Fabs + iB°5A§ + 6é*(Doc)* + anc*se*| (10.95) 


In addition to the commutation rules for the A?, Fy; given in (10.45), © leads 
to the following nontrivial commutation rules. 
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[AG (w, t), iB°(y, t)] = 1676 (« — y) 
{c%(a, t), o@(y, t)} = 16°°5) (a — y) 


{c°(a, t), (Doc)’(y, t)} aa — 1605) (x = y) (10.96) 

One can then see immediately that Q defined by (10.94) is the canonical 
generator of the BRST transformations, i.e., es Se 

ypQ+tQnp=-ibdp — (10.97) 


where the plus sign or anticommutator applies to the fermionic fields y = ¢,c 
and the minus sign or commutator applies to the bosonic fields y = A,, B. 
One can check that Q? = 0 in the canonical version as well. 

In quantizing the theory, we must split the various fields into the cre- 
ation and annihilation pieces (or negative- and positive-frequency pieces). 
The states can then be built up by the acting on the vacuum state many 
times by the creation parts, or the negative-frequency parts, of the fields. 
States containing ghosts would be obtained by applying c*‘~) many times. 
In this context, the ghost number is a very useful concept. The Lagrangian is 
invariant under c — e’Xc, € — e~*Xé for some constant ~. The corresponding 
charge is 


Qon = f da [-2*(Doc)® + doe" (10.98) 


Consider now the case of electrodynamics again. In this case, the BRST 
transformation of the ghost is zero, Qc = 0. The ghost number simplifies to 


Qon = f dx [-2*(Go0)" + deere") (10.99) 


We now show that the physical states |W) of electrodynamics can be spec- 
ified by the conditions 


Q |W) =0 
Qgn |7) = 0 (10.100) 


We shall also see that states |W) and |W’) = |W)+Q |) are equivalent, so that 
we may restrict to states which are not of the form Q |A). In other words, 
physical states are annihilated by Q and Q,,, but are themselves not of the 
form Q |A) for some state |). (Notice that any state of the form Q])) will 
be annihilated by Q since Q? = 0.) Introduce a field € defined by Q & = c. 
The field € is the part of A; which is not gauge-invariant; i.e., we can write 
A; = Aj, + 0;£, where AT obeys 0;A7 = 0. We then have a “quartet” of 
fields €, B, ¢, c, which we can call “unphysical” fields. The application of 
the negative-frequency parts of these operators on any state will generate a 
set of states with “unphysical” particles. The fields 0;Fo;, Ao, Ooc and Oz 
are related to the canonical conjugates of the quartet and are thus related 
to the positive-frequency parts of the quartet operators. Our argument will 
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be recursive, where we start with a state with a given number of “unphys- 
ical” particles and construct a state with more “unphysical” particles and 
require it to obey the conditions (10.100). Let |W) be a state with N of these 
“unphysical” particles. We can write 


Iw) = (BO Or +4 € Oo +&-) 05 4 cf?) Os) Ic) (10.101) 


where the ©; are some operators and |c) has less number of “unphysical” 
particles. We can also assume that |c) obeys the conditions (10.100). We 
now require |W) to obey the same conditions (10.100), i.e., 


Q |) = | B™ (QO1) + Oe + E- (QO2) + iBO 03 


— e-)(QO3) — c—)(QOa)| |c) 


=0 (10.102) 


Here QO denotes the change in O due to the BRST transformation; it is thus 
the commutator or anticommutator, appropriately, of Q with O. Equation 
(10.102) requires in general 


Q O71 = -103, Q O3 =0 
Q O4 = Or, Q O2=0 (10.103) 
The solution to these equations may be written as 
O,= 00+ pi, O03 =icO 
O4 = &0 + pa, O2 =i BO (10.104) 


This is the choice consistent with zero ghost number. Here O, O are operators 
of zero ghost number and are BRST-invariant; p;, p4 are also BRST-invariant 
operators. Using this solution 


\D) = (sae 7 ig) O+ (OBO 4 et) 6 Io) 
+(BO) p, +e pg) |o) (10.105) 
Since BOE) + iO) = Q(-ie€) and (BOY p, + c&-Yp4) = 
Q(-ie p; + € pa), we get 
|W) =Q |r) (10.106) 


for some state |). Thus, all states which have “unphysical particles” and 
obey the conditions (10.100), by recursion of the above argument, are of the 
form Q |A). Since Q is a self-adjoint operator, these are also zero-norm states, 
i.€., 
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ll Q 1A) l= (AlQ?|A) = 0 (10.107) 


These zero-norm states do not contribute to the matrix elements between 
physical states of any operator which is BRST-invariant, i.e., 


(H|OQ |A) = (i1Q O|A) = 0 (10.108) 


since Q|W) = 0. In particular they do not contribute to the S-matrix which 
is built up by time-evolution by the Hamiltonian. The Hamiltonian is BRST- 
invariant and hence time-evolution will preserve the BRST-invariance of the 
physical states. We also see that |W) and |W’) = |W) + @ |A) are equivalent. 
(Actually going from |W) to |W’) is the Hilbert space version of a gauge or 
BRST transformation.) We may thus restrict our analysis to states which 
obey the conditions (10.100), but which are not of the form Q |A) for some 
|\). By the recursion argument above, the only states of this kind must be 
states with no “unphysical” particles. They are built up using BRST-invariant 
operators which include gauge-invariant operators as well. Since Ar are Q- 
invariant, the general solution to (10.100) is of the form 


|W) = |Wr) + Q |A) (10.109) 


We have thus shown that states of transverse photons obey the conditions 
(10.100) and are not themselves of the form Q |A). In other words, each 
physical state belongs to a class of |W)’s which obey the conditions (10.100) 
with the equivalence relation |W) ~ |W) +@Q |A). (One may say that the 
physical states are cohomology classes of the BRST operator.) 

This description generalizes to the nonabelian case also, although the 
construction of the recursion argument is algebraically more complicated. 


10.7 Ward-Takahashi identities for O-symmetry 


We have shown that the physical states obey the condition Q |W) = 0. The 
Green’s functions of interest in the theory are the N-point functions for 
BRST-invariant operators. The S-matrix for physical states may be obtained 
from such Green’s functions. The symmetry translates into a set of identities 
for the Green’s functions, the Ward-Takahashi (WT) identities, which we 
now derive. These identities are crucial in showing that matrix elements of 
physical observables are independent of the specific gauge-fixing condition. 

The action to be used in the functional integral, namely (10.80), satisfies 
Q S, = 0. A representation of Q as an operator on functionals of the fields 
in four-dimensional space is given by 


Q= [ae Ce ma 


Consider now the expectation value of a function of fields and derivatives, 
say, O, which is not necessarily Q-invariant. 


Al 
= fol oe +iB a (10.110) 


2 0) de® 


mo 
7; 
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(0) = ‘i [dAded2] O(A, 0,2) exp (—S,(A,¢,2) 


= [laatactae' O(A’',c',@) exp(—S,(A’, c’, 2) 
(10.111) 


where we just renamed the variables of integration in the second expression. 
We now take the primed variables to be the unprimed ones plus Q-variations, 
i.e., 


At’ = At +w Q Aa 


C=C Sune" 
e=F+yVQe (10.112) 


where w is an arbitrary Grassmann number. If the measure of integration is 
invariant, which it should be for consistency of the theory, we get 


(0) = / [dAdcdé] (O + 60) exp (—S_ — 5S;) 
= (0) + (60) (10.113) 


since 6S, = 0. 60 = (Q ©) is the BRST variation of the operator O. (On 
the question of the invariance of the measure, see the chapters on anomalies. ) 
We can thus write the above equation as 


(010) =a (10.114) 


We now consider the Green’s functions 


G(x, 21, 22,...,2N) = | leAdeae O(z) [] os) exp (—S,) (10.115) 


where ©; are Q-invariant. Following similar arguments to what was given 
above, from (10.111) to (10.114), we find 


(20) TT Ole) = 0 (10.116) 


This is the basic WT identity for the Q-symmetry. An immediate consequence 
is that the S-matrix and matrix elements of gauge-invariant operators are 
independent of the gauge-fixing condition. For instance, let f; and fz be two 
different choices for the gauge-fixing in S,. We then have 


Soe / dc © lat: — fa) = O Vie (10.117) 


The Green’s function for a number of BRST-invariant operators O;(2;) cal- 
culated with the gauge-fixing f, can be written as 
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G1(21, Z2,...,2N) = [ew [[ oes) 
= yreeme Vi2 [[ Ox(2:) 
= f een [[ O:(es) + [ee Ri2) [| ses) 


= Go(21, 22, .--)2N) (10.118) 


where we have used the identity (10.116) in the last step and the fact that 
exp(—QVi2) = 1+ QO Rio, Riz = —Vi2 + $Vi2QVi2 + --:, which follows 
from Q? = 0. Eq.(10.118) shows that the N-point functions for Q-invariant 
operators, and in particular gauge-invariant operators, are independent of 
the gauge choice. The S-matrix elements, which are constructed from such 
Green’s functions, are also independent of the gauge choice. The identities 
for the change of the Green’s functions under a change of the gauge-fixing 
condition are known as the Slavnov-Taylor identities. They were originally 
expressed in a slightly different form. 

The WT identities and the result that Green’s functions of Q-invariant 
operators are independent of the gauge choice are somewhat formal, as we 
have derived them. In actual calculations, there may be divergences of inte- 
grals which require regularization and suitable subtractions in order to make 
the theory well-defined. If the regulator does not preserve O-symmetry, we 
may lose some of these results. There are three possibilities. 


1) There exists a regulator which respects Q-symmetry and this is the one 
used in the calculations. 

In this case, the WT identities hold and the above arguments for the 
gauge-independence of the S-matrix are true in the regularized, and ulti- 
mately, the renormalized theory. (Symmetries other than the Q-symmetry 
may be lost due to this regularization.) 


2) There exists a regulator which respects Q-symmetry, but a different reg- 
ulator which does not have manifest Q-symmetry is used. 

In this case, the regularized theory does not have Q-symmetry; in par- 
ticular, there are terms which are not Q-invariant with potentially divergent 
coefficients. Counterterms to be subtracted do not have Q-invariance, but it 
is possible to choose them in such a way that the renormalized theory has 
Q-invariance. The S-matrix is again independent of the gauge choice. (It may 
not be possible to choose counterterms so as to preserve other symmetries as 
well, they may be lost due to regularization.) 


3) There exists no regulator which preserves the Q-symmetry. 

In this case, Q-symmetry is definitely lost in the renormalized theory 
and we say that the Q-symmetry is anomalous. “Unphysical” states can con- 
tribute to matrix elements of observables and the theory loses unitarity in 
general. In order to have consistent theories, we must therefore eliminate pos- 
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sible sources of anomalies in the Q-symmetry. This can lead to constraints 
on the allowed matter content of a gauge theory. 

The WT identity for BRST-invariance can also be expressed in terms of 
the generating functional I for one-particle irreducible vertices. In this case, 
it is convenient to consider the functional 


eV — [laAdedeas) exp (-s, 7 om + HC? + Cn 


1 
+(Dyc)?K% — = | ada oal ove Tue ree) (10.119) 


W is a functional of the source functions J“, 7, 7, K, L, and J. I is now 
defined as 


A, c,é, K,L, JJ}= / JH AS + A207 + 2n* — W (10.120) 


Here we do a Legendre transformation for J*“, n* and 77, but keep the other 
sources. The WT identity is given by 


0= [leAdedeas) OQ exp (-s + | i cae Ml 


1 
+(D,c)?*K%# — 5 freed L* + re) (10.121) 


This can be written as 
1 
( i J**(Dyc)® + 5 fre'e® + iBin*) = 0 (10.122) 


While we can write J*, 7%, 7° in terms of derivatives of I’ with respect to 
the fields, we are still left with the expectation values of the operators B, D,,c, 
and f%¢c’c°. The extra sources help to write these in terms of derivatives. 
This was why they were introduced into the functional integral. We get 


[as [2 sc OF OF Or OF 


LE ee 12 
5Ag 6K 5a bea “5 y*5a8| ~” Se. 


This is the basic identity; by functionally differentiating with respect to var- 
ious fields and sources and setting them to zero, we get an infinite set of 
relations among the vertex functions. 


10.8 Renormalization of nonabelian theories 


We now consider some aspects of the renormalization of nonabelian gauge 
theories. For the choice f* = 0+ A® and with a = 1, the action (10.77) can 
be written as 
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s= i dbz jar 8,.A2)(aH AMY — 8” A%) 
ae 5 f7(OWAS = a, Aa) A AY ——— eo fie poremtin A! AraAS 
+ B24? +e f2O, co AMC + (0 . Ar)? (10.124) 


We have scaled the potential Av, — eAf, which is convenient for perturbation 
theory. All the operators involved i in he action are of dimension 4. The prop- 
agators behave like k~? as expected and power-counting arguments would 
show that potential divergences can only generate local terms of dimension 
4. In order to establish renormalizability of the theory, one needs the follow- 
ing. 

1) A subtraction procedure, essentially Bogolyubov’s recursion formula, to 
show that naive power-counting arguments do work. 


2) The WT identities for gauge- or BRST-invariance which would show that 
of all possible dimension 4 terms that one can write down, only those with 
the tensor and group structures indicated in (10.124) do arise with diver- 
gent coefficients, so that the divergences can be absorbed by redefining the 
coefficients of the various composite operator terms in (10.124). 

The WT identities can be maintained by using a gauge-invariant regular- 
ization such as dimensional regularization. One must establish rules for power 
counting, separating overlapping divergences, etc., in such a regularization so 
that it is evident that only dimension 4 terms are generated with potentially 
divergent coefficients. If these conditions are satisfied, we can do calculations 
in a nonabelian theory with the action 


Sq = fae |5 


28 (9, At — 8,,A2)(B"A™ — 8” A%*) 


+o = furr(g,, A= A%) APH Ac” 
oh abc ramn abu acv am an 
oA gabe gamn Abu Aev Am An + 7.6),c%9 Mc 
. Ab ce +30 9. A)? (10.125) 


where we naively put arbitrary renormalization factors for all composite oper- 
ator terms. This can be split into a “classical part” S given by S, of (10.125) 
with: 2; = Z3 = Fas A= Ma = = 1, and a counterterm action S, — S. 
The counterterm action is defined as = is needed to cancel eer di- 
vergences. If we have a regulator which respects the BRST-invariance, the 
possible divergences and hence the counterterm action will have this prop- 
erty. This can be seen recursively starting from the “classical action” S and 
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fixing counterms loop by loop. Thus if we are using a regulator which respects 
the BRST-invariance, one can restrict the form of the counterterm action to 
one which has BRST-invariance. The WT identities or BRST invariance will 
then imply that, of the six renormalization factors Z;, Z3, Za, Fi, AM » 
only four are independent; i.e., one has the relation 


’ 


Z4 A 4 


== 10.126 
Z\ 23 Z3 ( ) 
A simple way to see how these relations arise is to define the BRST transfor- 


mations 
Q Al, = Oye? + af Al c® 
Oc= ST 
Qc =70-A* (10.127) 


and require Q-invariance of the action (10.125). This will give aZ3 = eZ;, 
aZ; = eZ, and aZ3 = eZ, leading to (10.126). Notice that f%°A®c° and 
f2c’c® are composite operator terms in the transformations and as such, 
they do, in principle, need renormalization factors, which we have denoted 
by a, b. (They should not be confused with the Lie algebra indices.) From 
Q? = 0 on A®, 6 will turn out to be equal to a. We cannot impose Q? = 0 
on the antighost field ¢* since we have eliminated the B-field by its equation 
of motion. Thus Q? will be zero on @ only upon using the ghost equation of 
motion. The Q-invariance of the action (10.125) will further give 7 Z3 = A. 

Gauge theories in four dimensions, with or without spontaneous symme- 
try breaking, are renormalizable. If one breaks gauge invariance explicitly by 
adding a mass term $m A? to the Lagrangian, then we can lose renormal- 
izability or unitarity. For example, for a massive Abelian vector particle we 
have 


2 
L= 7 (uAv — 0,A,,)(O" A” — 0” A*) + = A + matter terms 


(10.128) 
The propagator D,,,(k) for A,, obeys the equation 
[(h? +m?) bya — Kuke] Dow = Suv (10.129) 
which has the solution 
Dy (K) = Set hak im’) (10.130) 


k2 +m? 


At large |k|, this behaves like k,k,/k? and so the ordinary rules of power 
counting break down. One gets divergences corresponding to operators of ar- 
bitrarily high dimension. The equations of motion for the Lagrangian (10.128) 
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have a constraint 0: A = 0, which can help to get rid of negative norm states. 
Alternatively, if one considers £ = +A, (—D)A¥ + 5m? A, A", the propagator 
is well-behaved at large |k|, but the Ap-component does not decouple and 
leads to problems with unitarity. The combination of the good features of 
these two cases above, viz., having enough gauge freedom to eliminate pos- 
sible negative norm states and the high-energy behavior of the propagator 
being k~?, occurs for spontaneously broken gauge theories. 


10.9 The fermionic action and QED again 


The BRST argument leading to relations among the renormalization con- 
stants can be extended to the case when fermions are coupled to the gauge 
field as well. The transformations on the fermion fields are of the form 


Qw = iac*(t?y) 
Qb = ia(pt*)c* (10.131) 


(A priori, we could have a different constant, say, 6, in these equations in 
place of a; but Q? = 0 will show that 6 should be a.) The fermionic part of 
the action can be written as 


S= [as Za(y- 8+m)yp — teZ py Abtry (10.132) 


With a regulator which respects BRST-invariance, we can again use a re- 
stricted action where the renormalization constants are related. Imposing Q- 
invariance on the fermion action (10.132), we get Z; = Zo. We can specialize 
these results to the case of quantum electrodynamics which is a U(1) gauge 
theory. We have already seen the relation Z; = Z2 by explicit calculation at 
the one-loop level. Here we see that it will hold in general. 


10.10 The propagator and the effective charge 


As an example of one-loop calculations in a nonabelian gauge theory, we shall 
now calculate the gauge boson propagator to one-loop order and relate it to 
an effective charge. 

The basic one-loop diagrams we need are 
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Fig 10.1. One-loop corrections to gauge boson propagator 


(The dashed lines represent ghost propagators.) These diagrams will lead to 
divergences and so they have to be regulated. Although we used a straight- 
forward momentum cut-off previously, the present calculations are rather 
invloved and can be simplified significantly by choosing a gauge-invariant 
regulator such as dimensional regularization. The basic idea here is to do the 
calculations in an arbitrary dimension n = 4 — €, where € is taken to be very 
small. This is done by analytic continuation of various expressions to an ar- 
bitrary dimension. Integrals involved in Feynman digrams are well defined as 
a function of n, with poles at n = 4 corresponding to the divergences in four 
dimensions. The idea therefore is to do a Laurent expansion in ¢, identify the 
pole terms, which are then canceled by choice of Z;-factors, and then take 
the limit « — 0. Since the concept of gauge-invariance does not depend on 
the dimension, this will be an invariant regularization. We will use such a 
procedure here. 

In using dimensional regularization, we first do the algebraic simplification 
of the integrands of Feynman integrals, keeping in mind that we are in n 
dimensions when contracting indices and taking traces. For the basic type of 
integrals involved, we can carry out the integration in n dimensions to obtain 
the following formulae. 


d"p 1 _ 1 F(a—n/2) i 
(2m) (p? + M2) (4n)"/2 (a) (M2 )a-0/? 
d"p p n 1 I(a—n/2-1) ih 
| Gay GPR 72 Gay? Fe) OR 
d"p Dawe _ Ow 1 I(a—n/2-1) 1 


(20) (p2 + M2)% — oe (41)"/2 I'(a) (M2)o-n/2-1 
(10.133) 
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where I’ denotes the Eulerian gamma function. The relevant Laurent expan- 

sion can be obtained by using the property z/'(z) = (z+ 1) and 
P(e/2)=--7y+O(e) © (10.134) 

+, is the Euler-Mascheroni constant, approximately equal to 0.5772. 

Contribution of diagram 1 


By writing out (e~°!"*) = = and carrying out the necessary Wick con- 
tractions, we obtain the contribution due to the first diagram as 


Paings = "5*Ca | G(a,2) AG (2)AS(c) (10.135) 


where C2 is the quadratic Casimir invariant for the adjoint representation of 
the group. It is defined by C25°? = f°" f°™" and for the group SU(N) it is 
equal to N. 


Contribution of diagram 2 


The second of the Feynman diagrams shown arises from 


etm — a (f ctelzaeeyaamy) wf emadeatas) 
(10.136) 


Carrying out the Wick contractions, we find 


a: Wo CCe) 


Tgiag2 = o. f|-3 poeta v(y)G(z, y)G(z, y) dt Spel) ASU) ae 
_ 54s (2) Al (2)G@,z) 


Sesnaten 1) ew OED + 2m ED ae,y)} 


Ort ane Ox Ox” 
(10.137 
where fj, — O,A, — OLA’. 
Contribution of diagram 3, the ghost loop 
The ghost loop contribution is easily seen to be 
Paiogs = ‘A At (2) AB(y) es) SC E¥) (10.138) 


Contribution of the first three diagrams 
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Combining (10.135, 10.137, 10.138), we obtain 


ree, [ PS agaraganaten ~ FF), (WCl@, w)G(a4) 


4086 
+ Fila) as(y) SY Gre, y) 
sii n—2 a a OG(z, y) OG(z, y) G(x, y) 
FAN) AD) Se ae Fae cto} 


(10.139) 


where fi, = 0,A/ — 0, Aj. Each of these terms carries a factor e”, where e 
is the coupling constant. This is seen by the rescaling A — eA. However, in 
dimensions other than four, the coupling constant is not dimensionless, e? in 
n dimensions has the mass dimension 4 — n = e. We introduce a parameter 
jt with the dimensions of mass and write e? = e?:*, where e will be the cou- 
pling constant in four dimensions. The factor of yu‘ and the four-dimensional 
coupling constant e? have been explicitly indicated in (10.139). 
For the first of the integrals we find 


n—2 oe ee d"p 1 
ite (21)" p? + M?| 


it —2 2-n ar 
a ae ae ( 2 )ary ° 


— appee (5) a 
M=0 


= (10.140) 


M=0 


The integral vanishes at finite « and hence can be taken to be zero in the 
regularized theory. Naively this integral is quadratically divergent; it is a 
peculiarity of dimensional regularization that quadratic divergences do not 
appear and such integrals can be set to zero. (This does not mean that all 
consequences of quadratic divergences disappear from a dimensionally regu- 
lated theory; there are examples of dimensionally regulated theories where the 
quadratic divergences can reappear if some summation of the perturbation 
series, such as Borel summation, is attempted.) 
The second term in (10.139) is 


Term: 2 = ofa (x) Loni e *k(2—y) 77 (2) (x) 
4 pu") da (Q0)" 
d"p ii 


(2)(-) — — (a) aa 
7? (k) = (u?) (27)” p2(p — k)? 


210 10 Gauge Theories 


ay? f a Qn)” [p? + k2 oo —u)}? 
= af du P'(e/2) exp E is 7 


= a3 ? —y+2+log4xr —" iow(k?/x”) (10.141) 


The third term can be evaluated similarly and is equal to the second term 
given above. The last two terms become 


ok 
(277)” 


1(4+5)(k) = (u)¢/2(1 — €/2) [ de 


e  tk(z—y) yz (4+5) (k) 


Terms (4+ 5) = 2 | a (x) AZ (y) 


d"p —2pyupy + kykyu(l — 2u) 
(27)” [p? + k?u(1 — u)|? 


2 +) 
= (ke uv Si i) eg 9672 5 [2 Vos log 47 a log(k?/u”) as ;| 
(10.142) 


Combining terms, the one-loop contribution to the quadratic term in the 
effective action becomes 


1 


a —tk(r— 
T= 4 oe nv (@) fav (ye (2-v) IT(k) 
a e e?C2 wy =a 2 oa 5 | 
Hk \e== Ten? E (2 —7+ log 4a log(k* / )) + 9 


(10.143) 


Contribution of fermion loops 


We also consider N¢ species of fermions in the fundamental representation 
of the group coupled to the gauge fields. The fermion action is given by 
S = qly:(0+A)+mlq, where A, = —it* A‘. t* are matrices in the fundamental 
representation and are a basis of the Lie algebra. For SU(N), they can be 
taken as traceless hermitian (N x N)-matrices. We have Tr(t*t?) = $6%°. The 
y-matrices obey Try” = 2”/25”. The one-loop contribution to the term 
in I’ which is quadratic in the A’s is 


il 


Fas]  Ai@)Aiy) oe mT) 
x,y,k 


i 
Hulk) Sean mata | du 
0 


, i d"p yup + (Suvk? — 2kyky)u(l — u) — du,(p? + m?) 
(27)” [p? + k?u(1 — u)]? 
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boa kuky) 
an 2N ( pv ly i a 
sare ae é du u(1 — u) 
k?u(1l —u) + “) 


2 (10.144) 


2 
x [2 —-+ 10825 — le ( 


With this explicit formula for the integral, the fermionic contribution to I" 
can then be simplified as 


. _ te Is (k) 


II;(k) = ? —~y+log2m — log pn ee] 


Tes 
(10.145) 


The effective action to the first order in hf also has a contribution from 
the Z3-factor in (10.125). The quadratic term in I’, to first order in h, is thus 


aap feladiplve Me Vin 
V(k) =Z3 + Tw) =F IT(k) (10.146) 


It is clear that one can choose Z3 so as to cancel the 1/e-term, so that the 
limit n — 4 can be taken without divergences. This shows the renormalization 
of the gauge-particle propagator to one loop order. It is possible to absorb 
some of the constants, namely, k-independent terms, in Z3 as well. Exactly 
how Z3 is defined is a choice of the renormalization scheme. Notice that the 
differences can be taken to be different definitions of 4. One scheme, known 
as minimal subtraction, eliminates only the 1/e-terms by choice of Z3. For 
our purpose it is easier to get rid of all the constant factors by a suitable 
choice of Z3. This corresponds to 


5C2 —2N. 3le?Cz  e7Nylog2 
* » ete ia Oh (I ie peed fe fe 
it re = (2 7 — log tn) + 14472 24m 
Be? ‘i e aa k?u(1—u) +m? 
V(k)=1+ 1372 Saree? / 2) _ a, du u(1 — u) log —— 
(10.147) 


For zero-mass fermions, which is what we shall consider for the rest of our 
discussion, we can take 73 as the expression given above minus 5e?.N¥/ 727?; 
we absorb the integral of u(1 — u) log u(1 — u) also into Z3. Then 


2 
V € C N kj? 2 0 


From the calculation given above, the corrected gauge-particle propagator is 
of the form 
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ii 
k2 V(k) 


Among other things, this shows that the quantity # which was introduced 
for dimensional reasons can now be interpreted in a more physical way. It is 
the value of momentum transfer k at which V(k) = 1 and the propagator is 
just 1/7 mnie. 

The interaction between fermions due to the gauge-particle exchange is 
given by 


G(k) = (10.149) 


e? Pfc al: f 
r= 5 f ateyua(e) at? yatu) G2.) (10.150) 


In the case of QED, we interpreted the modified interaction as an effective 
increase of the strength of interaction with k?, or as a k-dependent effective 
charge. In the present case, such an interpretation is possible, but the situa- 
tion is not quite so simple. The reason is that there are other diagrams which 
contribute to the interaction between fermions, such as vertex corrections. 
For QED, these other corrections are of the form shown in figure 10.2. 


Fig 10.2. Corrections to fermion-fermion interaction by gauge boson 
exchange 


The WT identity connects the vertex correction and fermion self-energy and 
in effect their contributions cancel out; they do not contribute to the effec- 
tive charge. This is true in the nonabelian case as well. But in the nonabelian 
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theory there is another contribution given by the diagram 10.3, shown below. 
There is a part of the mathematical expression for this diagram which is sim- 
ilar to the propagator contribution and it must be included in the definition 
of the effective charge. The expression for this diagram is given by 


Fig 10.3. The diagram for the pinching contribution 


Lr = et for / G(x)t?y,4(x) a(y)t’t°F,.(y, z)a(z) 


etk(z— y) tip’ (z— Pe zy a” p! 1 
igac) = — A (y-k-y-p')S(y, 2) Yu 


(21)" p?(p! + k)? 


— WS(y, z)(27-k+4-p') + 2p), + ku)yaS(y, or] (10.151) 


The integral has an interesting feature. By writing —y - p’S(y, zje PY = 
~iy- A(e~” ”)S(y, z), we can do a partial integration over y. This will pro- 
duce a term iy -0OS(y, z) which is 7 times a delta function, 16(”) (y — z). Such 
terms are called pinching terms, since the points y, z become identical, which 
can be visualized as the fermion propagator line being pinched off the di- 
agram. This pinching contribution is what we are interested in. The other 
terms, including derivatives with respect to y from the partial integration, 
involve either powers of p or k and do not lead to a current-current inter- 
action structure. Therefore, even though they can be important, they are 
not relevant for the effective charge. After some algebraic simplifications, the 
pinching contribution due to (10.151) is seen to be 


2 . : 
f= Sf aera aut ua) | = etk(z—y) F(k) 


~ ae i] 
2 (42) 222 = dp 
F(k) = (wu ) (Qn) (py! +k)? 
e2C2 ps 2 
= gee ST y+2+ ing Ar — log(k | ) (10.152) 


214 10 Gauge Theories 


The 1/e-terms and the constant terms can be canceled by choice of the vertex 
renormalization in this case. By combining this result with the propagator 
contribution in (10.149, 10.150), we get the effective .current-current interac- 
tion as 


1 20 
Tepf = 5 / gt?yg(x) Gtoyaty) Vers (ty) 
2 2 awl eC 
Ver (k) = ] (: i Fags (5C2 = 2N;)log(/2)) a 2 Jog(k?/2”) 
. ~1 
1 
R5 t+ a5 + oer Galen 2N;) lok? /1) (10.153) 


We now define the nonabelian analog of the fine structure constant as a = 
e?/4m. We can then write 


at 


Vers (k) = 4a (10.154) 


where the effective coupling constant which is a function of k? is identified as 


_ acc 
o(k) = TiaaeainiiontRe rae 
7 Bliss (10.155) 


The effective charge in QED was seen to increase with the momentum 
transfer k. In the present case, we see that a actually decreases with increas- 
ing k so long as the number of species of fermions is not too large, so that 
one has 11C2 > 2Ny. This means that at high energies the theory asymp- 
totically approaches a free theory; this property is called asymptotic freedom 
in the terminology of the renormlization group introduced in Chapter 9. The 
discovery of asymptotic freedom in 1973 was the crucial step in using non- 
abelian gauge theories to construct a theory of strong interactions. Quantum 
chromodynamics (QCD), which describes strong interactions, is an SU(3) 
gauge theory coupled to quarks which are in the fundamental representation 
of SU(3), namely, triplets. In this case, C2 = 3, and asymptotic freedom is 
possible if we have fewer than 16 species of quarks. There are six species 
known to date and there are reasons to believe that there are no more than 
six species. Asymptotic freedom is observed to hold for energies above a few 
GeV. At such energies, the quark mass terms in the above formulae cannot 
be neglected, the masses suppress their contributions in loop integrals. As a 
result very high mass quarks do not contribute to effective charge and so, in 
any case, the possible existence of very high mass quarks, beyond those we 
know, will not affect asymptotic freedom at presently observed energies. 

The effective charge may be regarded as the running constant obtained by 
solving the renormalization group equation. Since we have done a calculation 
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of the effective charge directly, we can use this to obtain the @-function for a 
nonabelian gauge theory, to one-loop order, as 


e? 


Ble) = ~Fe72 | 


The G-function is negative, as expected for asymptotic freedom. Here we have 
obtained the 3-function after calculating the effective charge. Going back to 
the action (10.125) and writing it in terms of bare fields, we see that the 


bare charge ep is related to the renormalized charge e as e9 = 2123 3/2 This 
shows that we can calculate the 6-function, by evaluating Z,, and using this 
formula and the definition (9.121); we have already obtained Z3. This is the 
more conventional way of calculating the 6-function in a nonabelian theory. 

The behavior of QCD at high energies may be understood by solving the 
RG equation for the Green’s functions, using the 3-function (10.156). One 
can use perturbation theory to compute the required anomalous dimensions 
by virtue of asymptotic freedom. Applications of perturbative QCD to high 
energy scattering processes, which is a vast subject in its own right, are based 
on this property. 

The formula for the effective charge shows that we can define a finite, 
dimensionful parameter Agcp by 


dents + blog Agen =0 (10.157) 
a(1) pe 


In terms of Agcp, the effective coupling may be written as 


1 
nA —————— 
= Fog Myon) 
127 


= ae 10.158 
(110 — 2Ny) log(k#/A2,cp) ae 


11C2 — 2N¥) (10.156) 


We see that the coupling constant is entirely determined by Agcp. The 
dimensionless coupling a(j) had been traded for a dimensionful parameter 
Agcp. This is known as dimensional transmutation. Agcp determines the 
relevance of various kinematic regimes and how they have to be analyzed. 
Modes of the field with values of & much larger than Agcp are weakly coupled 
and one can use perturbative analysis; modes with k comparable to, or less 
than, Agop, have to be treated nonperturbatively. Agcp is thus the basic 
scale parameter of the theory. The theory does not choose any value for 
Agcp; it is an input parameter for the theory. The fact that we have this 
freedom is a residue of the classical scale invariance of the theory. 
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11 Symmetry 


11.1 Realizations of symmetry 


We will begin with the discussion of continuous symmetries. In Chapter 3, 
we saw that, if a Lagrangian has a continuous global symmetry, then there 
is a conserved current mie and a conserved charge Q4 associated with it. 
Here A takes the values 1,2,---N, where N is the number of continuous 
symmetries. The charge Q4 is conserved, which implies that it commutes with 
the Hamiltonian; i.e., [Q4, H] = 0. This also shows that (Q4,Q?] commutes 
with H and so leads to new conserved charges. If we have already included 
all the conserved charges, the commutator [Q4,Q®] must be a function of 
the Q’s themselves. 

We begin with the case of the Q’s generating an internal symmetry. Let |) 
be an eigenstate of H with eigenvalue Ey, H|a) = E,|a). Since [H, Q4] = 0, 
H (Q4\a)) = E. (Q4\a)). Thus the action of Q4 on a state gives another 
state which is degenerate with it. By applying Q’s on |a) many times we get a 
sequence of states which are degenerate with it. This process can be continued 
until no new states are generated and further applications of Q’s only produce 
linear combinations of states already included. This leads to a set of states 
generated in this way from |a@) and which is closed under the action of Q’s. 
Let Ha denote the subspace of the Hilbert space corresponding to this, and 
let {|a;)} be an orthonormal basis for this subspace. Then Q4|a;) € Ha 
for |a;) € Ho. Thus we get a matrix representation of Q4 on the subspace 
Ha by (Q4)i; = (ai|Q4]a;). This forms a representation of the algebra of 
the Q’s under matrix multiplication since the Q’s do not connect any state 
in Ha to any state in the complement of H,. This representation is also 
irreducible by construction since any state in Ha is connected to the others 
by suitable Q-actions. Since the Q’s are also realized as hermitian operators, 
this is a unitary irreducible representation (UIR). By carrying out a similar 
procedure with the other eigenstates of H, we see that the states of the system 
can be grouped into UIR’s of the algebra of the symmetry operators. 

This result holds for discrete symmetries, such as parity, as well. When we 
include continuous spacetime symmetries such as the Poincaré transforma- 
tions, the Hamiltonian H does not commute with them, but becomes part of 
the symmetry algebra. In this case, the states again form UIR’s of the sym- 
metry algebra, including spacetime symmetries. The states within each UIR 
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are degenerate, not for eigenvalues of the Hamiltonian but for eigenvalues of 
some invariant operators. (The representations are also infinite dimensional 
when spacetime symmetry, which is a noncompact symmetry, is included.) 
Our discussion of relativistic invariance in the appendix is an example of this, 
where the representations are labeled by mass and spin. 

This nice result can be vitiated if Q4|a) is not. nermalizable and hence 
does not belong to the Hilbert space H. In this case, we do not have a unitary 
representation of the symmetry and it is said to be spontaneously broken. ‘To 
understand this structure better, we start with the fact that, in field theory, 
the states are generated by the application of various local operators on the 
ground state or the vacuum state |0). We can therefore write |a) = A,|0). 
From the commutation rules, we can evaluate [Q4, Aq] = B4. B4|0) will be 
some other local operator in the theory. Generally, the normalization of the 
states A,|0) and B4|0) will have to defined with some regularization; with 
this understood, we may take them to be normalizable. We now write 


Q4|a) = B2|0) + Aa Q*|0) ; (11.1) 


If the ground state has the symmetry, viz., if Q4|0) = 0, Q4|a) is normal- 
izable in the sense defined above, and it is clear that our argument of the 
previous paragraphs will go through. States can be grouped into UIR’s of the 
algebra of symmetries. The contrapositive of this statement is that if Q4|a) 
is not normalizable, the vacuum cannot be invariant. Thus, spontaneous sym- 
metry breaking corresponds to the situation when we have symmetry at the 
level of operator algebra, but the vacuum (or ground state) is not invariant. 
There is no unitary realization of the symmetry algebra. 

The discussion so far indicates that there can be two realizations of sym- 
metry in quantum field theory, which can be summarized as follows. If a 
Lagrangian has a continuous global symmetry given by a set of operators 
{Q4} forming an algebra A, we have two possibilities. 


1. The ground state is symmetric, Q4|0) = 0 and the states can be grouped 
into unitary irreducible representations of A. 

2. Q4|0) #0; in this case the symmetry is spontaneously broken and there 
is no unitary representation of the symmetry. 


The first case is often referred to as the Wigner realization of symmetry 
and the second as the Goldstone realization of symmetry. (Spontaneous 
breaking of discrete global symmetries is also possible. A field-theoretic way 
of implementing the spontaneous breaking of discrete symmetries will be clear 
from what follows, although we do not discuss them in detail.) 

In the real world, ignoring gravitational and cosmological effects, we have 
Poincaré symmetry. This is realized in the Wigner mode since the particles 
we observe fall into UIR’s of the Poincaré group. (This has been used to 
construct relativistic wave functions in the appendix.) For this to be possi- 
ble, the vacuum state must be Lorentz-invariant and this justifies the use of 
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Lorentz symmetry to eliminate the zero-point energy in Chapter 3. We dis- 
cuss Wigner realization of symmetries in this chapter; spontaneous symmetry 
breaking is taken up in the next chapter. 


11.2 Ward-Takahashi identities 


In quantum field theory, the quantities of interest are the Green’s functions. 
In the Euclidean formulation, these are generated by 


Z[J] =N fide) exp (-sia+ [ sav*) (iia) 


where {dy] is a suitable measure for functional integration over the fields yp“. 
If the theory has a certain symmetry, it manifests itself as relations among 
various Green’s functions. These relations, known as Ward-Takahashi (WT) 
identities, can be derived as follows. Since y is a variable of integration in 
(11.2) we can write 


[laeiexp (-stv' +/ J49'*) = [ae] exp (-siel +/ Jae") (11.3) 


We shall first consider continuous transformations. Let y(x) — y(x)+€(z, y) 
under the symmetry transformtion of interest, where £(z,y) is considered 
infinitesimal. The strategy is to choose the new variable of integration y’ 
in (11.3) as p + €. Under this transformation, the measure of integration 
transforms as 


(dyp’] = [dy] det M = [dy](1+ 67) (11.4) 
where 
A Hy; 
MAP (a, 3) = 647 5(2 —y) + 8) 
_ a f 5E4(2,9) 
6g = (Toes) oe 


(Tr denotes the trace over the indices A, B as well as a functional trace; in 
other words, we take a trace considering (A, z) and (B, y) as matrix indices.) 

We also have S(y’) = S(y + €) = S(y) + 5¢S. Using these in (11.3) and 
collecting together terms which are of the first order in €, we get 


eases fuse acre 0 (11.6) 


where the angular brackets denote the functional average 


(Or= N f (ae exp (-Sie + [ Jae’) O (11.7) 
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Equation (11.6) is the basic Ward-Takahashi identity. This can be trans- 
formed into an infinite set of relations among all the Green’s functions by 
expansion in powers of J4. We shall illustrate this by examples. 

Consider a theory of N scalar fields y*, a = 1,2,---,N, with the Eu- 
clidean action 


“eR SQ eG 


S= pas Fume + smh + A(p2y")? (11.8) 


This theory has invariance under the transformation 


ye = yg’? ty yp + wy? (11.9) 
where w® is antisymmetric in a,b and independent of z“. We may write 
we = wA(TA)%® where (T4)*, A = 1,2,---,5N(N — 1), are generators of 
O(N), the set of orthogonal transformations in N variables. Thus (11.8) has a 
global O(N) symmetry. The symmetry is global, as opposed to local, because 
the parameters are independent of x". 

We now consider the change of variables y?(z) — y’?(x) + w°?(z)y?(z) 
in the functional integral. (For the symmetry (11.9), the parameters w% 
are independent of spacetime. To obtain the Ward-Takahashi identities, we 
consider a change of variables with w2° which are functions of x.) In this 
case 


js= — f 5,w°(oa" 
= | o a(ero%" (11.10) 


For the change of variables y?(z) — y’(r) + w%(x)y?(zx), the matrix 
M(x, y) = (6% + w%(x))6 (x — y). The Jacobian is thus independent 
of y and since the determinant of an O(V)-transformation is 1, it is possible 
to regulate in such a way that 67 = 0. The general identity (11.6) becomes 


(—d,(y2O"y? — PO" pt) + J*y?—J*p)=0 = (11.11) 
Writing Z[J] =e], we have 


a | w .. ow Z| ew 


Bat | I(x) 5I%(a) * Oye OUT, . seer) 
éwW 


+Jal@) SF (6) os (a Car b) =0 


(11.12) 


By expanding W[J] in powers of Ja, this becomes a set of relations among 
the connected Green’s functions of the theory. 
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If we now take w®? to be independent of x”, then dS = 0, and we get 


oW éW 
d‘xr ees | — 
/ (sate 2) sei) 5 - la) sr | 0 (11.13) 
Written in terms of I'[y], using Ja = 6I°/dy%, this equation reads 
6r 6r 
4 a. _ @? Ss 
pes (2 558 ® 5a - 0 (11.14) 


In other words, the quantum effective action I’[®] has the same O(N)- 
symmetry as the classical action. 


11.3 Ward-Takahashi identities for electrodynamics 


We now discuss the WT identities related to gauge invariance in quantum 
electrodynamics. Calculations in QED are done by functional integration over 
the Euclidean action 


S2(A,0,v[ dtc |r, FH + 2(0- A? 


+Zo [pb (y-O+m— dm) p — iedy*bA,] 
(11.15) 


In general, we have Z, and Z, in the fermionic part of the action, but we 
have already seen in the last chapter that when we have a regulator which 
preserves the BRST invariance, we can set Z; = Za. We have assumed this 
in the above expression. 

This action does not have gauge-invariance due to the gauge-fixing term 
(0- A)?. Thus one can only expect BRST-invariance and the associated WT 
identities. Nevertheless, it is possible to obtain WT identities for gauge trans- 
formations using the action (11.15). Their derivation will be slightly more 
involved than in the case of BRST identities, but they have the simplicity 
that we do not need the full formalism of the BRST-transformations. 

We start with the functional integral 


Antes fl eine eae (-se(4.4.9) : / J-A+ m+ 50) 


z / dulA’,v',@') exp (-set4’, we) + / TAL a! + en) 
(11.16) 


where in the second step we have made a semantic change of variables, just 
renaming the variables of integration. We now take the new variables as 
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A'=A+0,0 
a! ans ef w 
p! =p ei (11.17) 


This is now substituted into the second expression for Z[J,~, 7] in (11.16) 
and taking @ to be small, we collect the terms which are of the first order in 
6. If the measure of integration is invariant under this set of transformations, 
we get 


(~a fa-aores [7° 00 + ie | (qv — on)e) =i (11.18) 


Introducing log Z = W and the generating functional for the 1PI vertices, 
namely, PA, , }] by 


WlJ,n, 7] = Flava] + [m+ int s-A 


6r 
b= a 
or a 6r 
0 Ba Se ae (11.19) 
we can rewrite equation (11.18) as 
or -OF or 
== 3\ 0 2 aA i Vee eg —— —. — 2 
[sre + [ 00-5 ico or + Fo 0 (11.20) 


(Here A,,, w,w are independent fields and not variables of integration as in 
(11.18).) Since I is a functional of the fields A, 7,7, its change under an 
infinitesimal gauge transformation (11.17) is given by 


oy 
: 7 sn on 
= [00-5 -ie6 |b Ev S| 
éP | leer ee 


Equation (11.20) can thus be written as 


56 [r - / (a: A)| =-() (11.22) 


If we define I* by 
ee 
P=" te / (0-A)? (p23) 


. 
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equation (11.22) shows that I* is gauge invariant. If we expand I or I* 
in terms of vertex functions, this translates into an infinite set of identities 
connecting various vertex functions. 

As an example, consider the two-point term in I* given as 


re) = f Aya" (e, 9) Arlo) (11.24) 


The invariance of this expression under infinitesimal gauge transformations, 
namely, equation (11.22), gives us immediately 


6,,V"" (x,y) =0 (11.25) 


Writing Viv(z,y) = (-O dw + 0,,0,)6 (2 — y) + Hwv(2 — y) and taking 
Fourier transforms, this is equivalent to 


k,IT¥” (k) =0 | (11.26) 


Lorentz invariance requires that IT, should only be a function of k?; along 
with the transversality condition (11.26), this gives 


Ui (eo —Kaky) Ue) (11.27) 


As another example, consider the terms involving #1) and >A; these 
can mix under gauge transformations, so we must consider them together. 
We write 


ree) — | Ha)F(e,2)H(e) ie f HOV, (e,y,2)6@)AMW) +--+ (11.28) 
The invariance of this under an infinitesimal gauge transformation gives 
ie [02-6 HOF.) + f OWE) Velev. 20) 
—e | (8(2) ~ 6(2)) BlaV. (av, 2)0()AMy) +--+ = 0 
(11.29) 


Setting the coefficient of the 7 term to zero, we get 


F(z, 2) (5 (z—y) —6O (x — )) i moe y,z) =0 (11.30) 


For the terms involving A“ in (11.29), we need to consider the variation of 
terms involving two A’s; so we cannot set the coefficient of this term, as it is 
written in (11.29), to zero. In (11.30) we introduce the Fourier transforms 
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Vila, ys 2) = / V,,(p, qe? Vetale—-v) 


Pq 
F(a, z)6(z—y) = F(p)e?@-) e-ta(2—») 
big 
F(a, 2)5 (2 -y) = / Pige™ mie” (11.31) 
Pq 


In the last two equations, we recover the usual forms if we make the replace- 
ments g - q+p, p > p+4q, respectively. Using (11.31), we can simplify 
(11.30) as ‘ 

F(p) — F(q) + 4(q—p)*Vu(p, q) = 0 (11.32) 


Expanding this equation for small values of q — p, we get the equation 


OF... 
~ Op# se iV,.(p, p) = 0 (11.33) 
Equations (11.32) and (11.33) are the forms in which the WT identities were 
originally obtained. Since F(p) = iy-p+m+JZ(p), and V,.(p,q) = y+ (p,q), 
we see that these equations give an identity relating the electron self-energy 
2(p) to the vertex correction I,,(p, q). 
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We now discuss some of the discrete symmetries of a gauge theory. These will 
include parity, charge conjugation, and time-reversal. When used in conjunc- 
tion with the WT identities expressing the gauge-invariance of the effective 
action, these can be very powerful in restricting the form of the action (and 
the effective action) and can lead to low-energy theorems which are of general 
validity, not necessarily restricted to perturbation theory. 

In the terms of functional integral representations, discrete symmetries 
can be understood as follows. Let ¢ denote the transform of the field vari- 
able y under the discrete symmetry like parity, charge conjugation or time- 
reversal. This means that the Green’s function 


G=N [ide] eS Fler) 9(02)---Glew) (11.34) 
will describe the time-evolution of the transformed particle states. If the 


action has the property that S(y) = S(¢), and the measure has thus property 
as well, then we can write 


G=N [lag] 9(e1)G(a2)---Gaw) 


=G =N fig eS) B(21) (x2) +-- O(a) 
G 


(11.35) 
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since the change y — ¢ is reduced to a semantic change of variables in the 
integral. We will now work out these transformations for parity, charge con- 
jugation, and time-reversal. Some of these symmetries require the Minkowski 
metric and so in this section we shall use the functional integral in Minkowski 
space. 


Parity 
Parity corresponds to 2 — —z. On Dirac spinors this is implemented by 
p(w, t) = y°x(—a, t) (11.36) 


where y(z, t) is the parity image or parity transform of w(x, t). On the gauge 
field, parity is given by 


Ao(a, t) = Ao(—2,, t) 
A;(z, t) = —A;(—a, t) - (11.37) 


where the tildes denote parity transforms. 
We can check the parity invariance of QED by checking the various terms 
involved. As an example, consider fy - Oy. Writing < = (—a, t), we find 


il d‘x wiry - Oy = ‘i d*a x! (&)iy"Oyy°x(Z) 
= pas x! (%)i(Oo — y°y'0;)x(Z) 
= f ac x@)in sexe) 
Ore 
= fae (xin O.0 ee (11.38) 
(The change of limits compensates for the replacement of x by £ in the 
measure of integration.) Equation (11.38) shows the parity invariance of the 


kinetic term of the Dirac Lagrangian. 
For the standard bilinear covariants of the Dirac theory, we have the parity 


property 
py(x) = Xx(z) 


pyp(z) = —Xy°x(Z) (11.39) 
py b(z) = X°x(Z) 
py(a) = —Xy'x(2) (11.40) 


py yy(2) = —x7°y°x() 
pyiy?d(x) = x7 x(2) (11.41) 
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¥ f°, 7'] ¥(a) = -* [7,7] x@) 
v ly, 77] v(x) = x [, 7’] xt) (11.42) 
Thus wy) is a scalar, wy>w is a pseudoscalar, wy" is a vector, wytyp is an 


axial vector, and #[y",y"]w is a rank 2 tensor. 
The definition of parity (11.36) shows also that..-, -. 


(1+ 7°)o(a,t) = 7° 3(1 ¥ 7°)x(-2, t) (11.43) 


Thus the chiral projections p, = 4(1+7°)p and wr = 3(1 — 7°)w are 
transformed into each other under parity. A theory which has only one type 
of chiral fermions or a theory in which there are gauge fields which couple only 
to one type of chiral projection (e.g., A,wy"(1+7°)wv ) will necessarily violate 
parity invariance. The standard model of weak interactions is an example of 
such a theory. 


Charge conjugation 


This corresponds to the exchange of charges, for example, e* < e7, in 
any process. It is given by 


H(z) = CX" (2) 
A, (2) = —A,(z) (11.44) 
where C’ is the charge conjugation matrix defined by 
Col yO = —yHt? (11.45) 
Recall that the y-matrices are defined by the algebra 
he tae are (11.46) 


By taking the transpose of this equation, we see that y“7 should also obey 
the same algebra. Therefore, by the general theorem on the Clifford algebra 
mentioned in Chapter 1, we see that there should exist a matrix C with 
the property (11.45). The specific form of the matrix C will depend on the 
representation chosen for the yy. For the choice 


mes ee 8) (oe Ola w* 


C = i747’, c'1=-c=c! (11.48) 

In this case, we can also write (xr) = Cy°x*(x) = —iy?x*(x) as the trans- 
formation of the fermion field. 

Based on the rules given above, the C-invariance of any term in a La- 


grangian can be easily checked. For example, consider the electron kinetic 
term in QED. 


we have 
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Wy: (0 —tieA)p = x77°Cty?+# (4, + ieAy)Cy?x* 
= yee, + teAy) x" 
= dan! yy x= teduxl atx 
= —O,X7'x — eA, Xy"x (11.49) 


where we have used (11.45). Notice also that in the third step we get an extra 
minus sign in rearranging y7 and y* because they are Grassmann-valued. In 
the action, we can do a partial integration to get 


sei / Poet oe era 
= [es i |-a.er"x — ied, x7"x| 
= pas i [ey“dux ~ icA Ky" x] 
= S(A, x, x) (11.50) 


The C-invariance of the other terms in the Lagrangian can be checked in a 
similar way. For the standard bilinear covariants we have 


Wp = XX 
pyr = xx 
pytp = —xr"x 
Pye = xox 
wy, Wb = xiv", 71x (11.51) 


Notice that C-1y5C = 7°? using 7° = i7°y1y74°. 


When there are nonabelian gauge fields, the notion of charge conjugation 
involves conjugation in the Lie algebra as well. Consider, for example, the 
term wyApy, where w transforms as some representation of the gauge group. 
w is a column vector on which the Lie algebra matrices t® can act as a linear 
transformation. A, = —it*A/, and t® are the generators of the Lie algebra in 
the representation corresponding to w. Using the charge conjugation property 
(11.44), we have 


i Arpyhtep = —iAe xT ClyyHt2Cy?x* 
a a9 op ac he fla alae a 


= 1 AG ytOyheet y (11.52) 
This term will have invariance if we define the charge conjugation prop- 
erty of the gauge field as t*Ay, = ct )Ay in this case, —1Appy't?y = 


—iAryytt?y. This definition of charge conjugation corresponds exactly to 
conjugation in the Lie algebra. Writing the Lie algebra commutation rules as 
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(7, >] = ifet’, we see that if t® form a matrix representation, then so do 
—t°T The transformation t¢ + —t°7 is an automorphism of the Lie algebra. 
In terms of representations, this corresponds to replacing a given representa- 
tion by its conjugate. Thus Afit® = Aa (—t27) is indeed the correct definition 
of charge conjugation. 


Time — reversal 


Time-reversal transformation is defined by (a,t) — (x,—t) = z. The 
time-reversal transformation of fields is given by 


p(w, t) = 7°Cx* (x, -t) 
Ao (2, t) = Ao(@, —t) 
Aj( a) = —A;(x, —t) : (11.53) 
Time-reversal is an antiunitary transformation. In terms of operators, this 
means that if A, B denote the transforms of operators A and B, then AB = 
BA. 
Consider now the change of various terms in the action in the functional 
integral for QED under (11.53) viewed as a change of variables. It is easily 
seen that the bosonic terms are invariant, i.e., S(A) = S(A). For the kinetic 


term of the Dirac action, by rearrangement of fields and a partial integration, 
we find 


/ d*z iy -(0—ieA)p =i if d*z ptotvrtactox 
~iogF CMP FOX Ay 
=j | d+z [xt 7 Oux + text yyy Ay] 
= i d*az [X(-y°Oo — °:)x + tex(7" Ao — 7*A:)x] 
=i | de |\—( ae) Pees (Z) 
Xt aeaX } + texy"Ay(Z)x 
|| ner Ot ae s 
= i d*z lee) iy" (ser - icA,(@)) x(@| 
= - fds xX iy? (Ou — ieA,,)x (11.54) 


In the last step, we renamed the variable of integration < as x. We get the 
same term with the time-reversed fields substituted in, but there is an extra 
minus sign. A similar result holds for the mass term in QED, so that for the 
whole fermionic part of the action we get 


S(A,,$) =-S(A,x,X) (11.55) 
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This change of sign is related to the antiunitary nature of the time-reversal 
transformation. At the level of the S-matrix, this corresponds, as should 
be expected, to the replacement of processes by time-reversed processes. At 
the level of the generating functional Z, invariance of QED follows from the 
fact that the determinant obtained by integration over fermions, namely, 
det (iy - (0 — ieA) — m), is also equal to det[—(iy - (0 — ieA) — m)}. 


Furry’s theorem 


The invariance of QED under charge conjugation leads to the result that 
all Feynman diagrams with only an odd number of external photon lines 
vanish. This is known as Furry’s theorem. It is easily obtained as follows. We 
carry out the change of variables 7 — Cy°x* in the functional integral. We 
do not make any change for the electromagnetic potential A,. From (11.50), 
we have S(A, wv, ¥) = S(A, x, ¥) = S(—A, x, X) for the fermionic action. The 
bosonic part is even in A and so we have indeed 


S(A,v, b) = S(-A,x, X) (11.56) 


for the whole theory. We then find 
Z( J] = [laAavad SA DY of JPA, 
= [laAarax) eS(-AXD oS JH Ay 


= ‘i [dAdxdy] e(Ax%)@~ J I*Au 
= Z[-J] (11.57) 


This shows immediately that all diagrams with only an odd number of photon 
external lines will vanish. (If there are external fermion lines, the result is 
different since there will be changes for the fermion sources.) 


CPTtheorem 


While QED has the discrete symmetries of C', P and T’, the standard 
model of particle interactions violates parity invariance and, to a small ex- 
tent, T invariance. However, if we have a Lorentz-invariant theory with an 
interaction Hamiltonian which is the integral of a local hermitian density and 
if the fields are quantized with the proper spin-statistics connection, then, it 
can be proved that CPT, the combined operation of all three, is always a 
symmetry. This is the celebrated CPT theorem. The product can be taken 
in any order. 


232 11 Symmetry 


11.5 Low-energy theorem for Compton scattering 


The WT idenitity for gauge-invariance (11.22) tells.us that '* for QED is 
gauge-invariant. By combining this with the discrete symmetries, it is possible 
to derive general low-energy theorems for certain scattering processes. Here 
we illustrate how this can be done for Compton. scattering. We take the 
incoming fermion momentum to be p, the outgoing fermion momentum to be 
p'; k, k’ will denote the incoming and outgoing photon momenta, respectively. 
We want to consider the forward scattering amplitude for which p = p’ and 
k = k’. For the forward scattering amplitude, the only kinematic invariant is 
w = p-k/m, and we want to calculate the scattering amplitude to the linear 
order in w. : 

As we discussed in Chapter 8, I" is the quantum effective action, quantum 
effects are already included in it, and so, we can do classical scattering theory 
using I" as the action to obtain the full quantum S-matrix of the theory. In 
other words, the S-matrix can be constructed from I" by considering only the 
tree diagrams. Consider an expansion of the J* in powers of the photon field 
A,. For Compton scattering, with one incoming and one outgoing photon, 
we need the terms with one and two powers of A, only. (Higher powers of A 
can be eliminated since we should not form photon loops when using I’.) We 
are also interested in low energy scattering, so we can consider various vertex 
functions, as in (11.28), expanded in powers of the momenta or derivatives 
of fields. For example, the two-point function for the fermions is of the form 


r3,= [ Be) Ke.vv) (11.58) 


The equation K7 = 0 must have as solutions the usual plane wave solutions 
of the Dirac equation. This tells us that, in momentum space, K must have 
the form K(p) = (y- p—m)h(s) where h(s) is some function which can be 
expanded in powers of s = (p*—m?)/m? and h(0) = 1 to ensure that the fields 
are normalized properly. (If we consider g as a function of p* rather than s, 
then we need h(p? = m?) = 1. One can rearrange the series to write it in terms 
of s.) In the interacting theory, WT identities connect this two-point function 
to the photon vertices. In coordinate-space, momenta are derivatives and 
terms with derivatives are not gauge invariant. The photon vertices are such 
that they combine with the two-point function appropriately to ensure gauge- 
invariance. Since we know that derivatives must become covariant derivatives, 
0 — D=0-ieA, for reasons of gauge invariance, it is easy to write down 
the general form of the function. We need 


Ty = 5 [8 (5) (iy Dm) + (iy D—m)W(SY (11.59) 


Here m?S = —D? — m?. Notice that S' does not commute with 7 - D, so the 
ordering of h and y-D is important. The symmetric ordering is consistent with 
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C-invariance and the fact that we have free photons in Compton scattering. 
Notice that the relevant commutator is 


bliy- D, —D? |p = —2epy" Fur Did — ery"b(Dy Fv) (11.60) 


The second term is irrelevant for Compton scattering which does not need 
photon propagators and so the photon field can be taken to obey the equation 
of motion. The first term is ruled out by C-invariance. This justifies the 
symmetric ordering in (11.59). 

In addition to the two-point function given above, we can have terms like 


— | eg VSR ay + oF ay PO + ee, 


The term with the two F’s is also nonlocal in general. In terms of photon 
momenta k, it is already of order k?, so higher derivatives are irrelevant at 
low energies; even this term can be neglected to the order we are calculating. 
Similarly, for the first term in (11.61) we only need f(0), the higher terms 
will be negligible for k small compared to the mass of the fermion. f(0) is 
related to the anomalous magnetic moment of the particle and, in fact, by 
comparing this term with the nonrelativistic limit of the Dirac theory, we can 
identify f(0) = 

For low-energy Compton scattering, we only need h(S) + 1+h1S = 
1+h,(m?s—eA-p—ep-A+e?A?)/m?. The relevant terms are then obtained 
as 


rear 4 r® 4 p@Q) 4 pA) 4 p®@ 
PO = $(iy-8—m)h(s)p 


PO = ei [y- Ah(s) + h(s)7- Aly 
PO) = (9-2) WE 
re) = ong Bl (er @—m)(A-P+P-A) 


4(A-P+P-A)iy-0—m)|¢ 


r?) = hj p (iy: —m)A? + A? (iy-0—m)] 
ie 
1 ccaseonencmiomyeay 


(11.62) 


In this expression P = i0, and we have already dropped the F?-term from 
(11.61) for reasons cited. 

Consider the term I") with two A’s first. Here w and w will be replaced 
by free-particle wave functions which obey the equations of motion. So the 
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first term in I?) does not contribute to Compton scattering. The second 
term will involve (2p + k)-e or (2p+ k)-e’. It is possible to choose a gauge 
where p-e = 0 and at low energies we get p- e’ ~ 0. (This is evident if we 
go to the rest frame of the initial fermion.) All of '@) is not relevant to the 
order in photon momentum that we are calculating. For similar reasons, the 
term I”) (with (A-P+P.-A)) can be seen to be negligible. The only terms 
which are important to the order we are interested in are 7, FP) and 
Ir@). In computing the amplitude using these terms, we encounter h(s) for 
s = ((p+k)?—m?)/m? = 2p-k/m? = 2w/m and for s = ((p—k)? —m?)/m? = 
—2Qw/m. 

The probability amplitude te to ['), taken to second order, is given by 


eine: a 
ae = fr : ‘(1+hZ) (y- (P+ hk) — m)(1 + 2hyw/m) "** 


z 


a (1-2) ———— Ve 
4 ‘m) (y-(p—k) —m)(1—2hyw/m) "© |" 
let 
~~) —ie purty ey ky e+ y-ey-by-e] tp 
~ ier e’ : 11.63 
Rim (11.63) 
where we have used the fact that, because we chose p- e = 0, 
(y-p+m) 7: etp = —7-e(y-p+ m)up = 0 (11.64) 


and the similar result for e’. 
The amplitude which involves the product of P) and P@ is zero to this 
order. The contribution due to 7) taken to second order gives 


2)? i 
M®) = € {e is “Ike! a .~B ™ 
64m2 Up ly 9 a aera a Y Ik €g 
a 
BAY b - e) fs) y 
sa ry ] pe are RS rs. ra |kae€g Up 
(11.65) 


By rationalizing the propagator and rearranging the terms in the numerator 
and using (y- k)? = 0, we can simplify this to 


Ag 


lot alte hueyer (11.66) 


Te 


We have also dropped some more terms which are negligible for small k. Since 
Y + PUp = MUpy WE May write 
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sz = 
tpl 1 Tup = 5 tp ly yl’. a) + 74[y yy plup (11.67) 


Simplifying the y-matrix algebra, we find 


= — = pH vy a 2p 2p” a 
Upy"[y", y"]up = tip — les ar ayhey? — _ tbe (11.68) 
Substituting this in (11.66), we get 
a’) & eee a 
M = Bm2 9 ao 2) Ww lipo’ up(e x e)s (11.69) 


The forward scattering amplitude can be parametrized in general as 
M = itip [fi(w) e-e! + iwfe(w) (e’ x e): o*] up (11.70) 


where fi;(w) and f2(w) are the (photon) helicity-preserving and _helicity- 
flipping amplitudes. Combining our results (11.63) and (11.69), we get the 
total scattering amplitude in the forward direction at low energies as 


2 2 
M * itip |-Se -e iws (9 — 2)?(e’ x e)ia*| up (11.71) 
In other words, 
2 
e 
Oe 
e” a 

f2(0) = a2 9 = (11.72) 


The low-energy forward Compton scattering is entirely determined by the 
charge and magnetic moment of the fermion on which the photon scatters. 
This result is very general. We have not made any assumptions of perturba- 
tion theory, nor have we assumed that there are only electromagnetic inter- 
actions. We have only used the invariance properties of QED coupled with a 
low-energy expansion. This result can hold for even composite particles like 
the proton. 

Direct verification of (11.72) for the proton, for example, is difficult; usu- 
ally it is converted to a sum rule on cross sections for scattering of polarized 
photons on the target fermion by using dispersion relations. This sum rule is 
known as the Drell-Hearn sum rule. It is in good agreement with experimental 
data. 
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12 Spontaneous symmetry breaking 


12.1 Spontaneous breaking of a continuous global 
symmetry 


In this chapter we give a more detailed description of spontaneous symmetry 
breaking or the Goldstone realization of symmetry, focusing on continuous 
global symmetries at first. The characterization of the ground state or vacuum 
state by Q“|0) 4 0, for a spontaneously broken symmetry generated by Q4, 
is not very convenient for calculational purposes. Therefore we shall rewrite 
this in somewhat different ways. Let A, be a set of operators which transform 
nontrivially under the continuous symmetry Lie group G. For an infinitesimal 
transformation, we have 6A, = (t2,04) Ag for some group parameters 64 
and an are the generators of the Lie algebra of G in the matrix representation 
to which Ag belong. Since Q4 generate group transformations in the quantum 
theory, this means that we can write 


i6 Aq = (Aa, Q- 6] 
= i(t4,04) Ag (12.1) 


The finite version of this relation is 
eet Ag a = Daeit) Ag (12:2) 


where D,,(6) is the matrix representing the transformation corresponding 
to the parameters 64 in the representation to which Ag belong. Taking the 
vacuum (ground state) expectation value of this equation, we obtain 


(Ole"@? Ag, e~*9'°|0) = Daa(@) (0|Aal0) (12.3) 


From this equation we see that, if Q4|0) = 0, (0|Aa|0) = Dag(@) (0|A,|0). 
Since D.g(9) is not the identity matrix for all 0, because A, is taken to 
transform nontrivially, this means that (0|Aq|0) must be zero, giving us the 


statement 
Q“\0) =0 => (0|A.|0) = 0 (12.4) 


for all A, which are not invariant under G. The contrapositive of this state- 
ment is that if an operator which is not invariant under G develops a nonzero 
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vacuum (ground state) expectation value , then the symmetry G is sponta- 
neously broken. This gives us a way of implementing spontaneous symmetry 
breaking by assigning vacuum expectation values to operators which are not 
invariant under G. Of course, we cannot assign expectation values arbitrar- 
ily. The ground state is obtained by minimizing the Hamiltonian. So what we 
mean is that, in a theory with spontaneous symmetry breaking, the Hamil- 
tonian is such that its minimization leads to nonzero expectation values for 
certain operators which transform nontrivially under G. 


Semiclassical construction of the ground state 


The construction of a ground state with the property of spontaneous 
symmetry breaking can be explicitly carried out if a semiclassical approach is 
valid for the theory under consideration. We illustrate this by considering the 
example of spontaneous breaking of a U(1)-symmetry. Consider the theory 
of a complex scalar field y with the action 


S= / d‘z [d,y"04p — a y*p —b (y*y)?] (12.5) 


This has an obvious U(1) symmetry, the transformations on the field being 
y — y’ = ey. The Hamiltonian corresponding to (12.5) is given by 


a | dx [dop*doy + Vo"Vy +0 ye +b (y*9)] (12.6) 


Classically it is easy to choose the ground state. We must minimize the Hamil- 
tonian. The field configuration (y,0oy) which minimizes H is the ground 
state configuration. This configuration will depend on the values of the pa- 
rameters a, b. If b is negative, H is minimized by taking O9y = 0, y -> 00, in 
which case H — —oo. There is no ground state; this is an unphysical case. 
We must therefore require that b > 0. Then there are two cases of interest. 
1) a > 0. In this case every term in H is positive and H is minimized by the 
configuration Oop = 0, y = 0. 
2) a < 0. In this case, we rewrite the Hamiltonian as 
2 

AS pes |a0p*doy + Veo"Ve + b (y*p — v2)") -- [es ae (12.7) 
where v* = |a|/2b. The configuration which minimizes H is evidently given 
by 


Ooy = 0, Vy =0 
g*p=vu* or yp=ve® (12.8) 
(a is independent of x since 0,9 = 0.) 
The classical values can be realized as expectation values in the quan- 


tum theory. Denoting the operator corresponding to y as ¢, notice that the 
expectation value of a product like ¢*¢ is given by 
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(0|6" 910) = (0|d*|0) (0|4|0) + O(h) (12.9) 


Thus, at least semiclassically, we can specify the properties of the ground 
state by writing (0|¢|0) = v where v is obtained by minimizing the classical 
Hamiltonian. The two possibilities are thus 


1. a>0 ' —" (O\g|0) = 0 
2.a<0 (0|¢|0) = ./la|/2b e** + O(A). 


Since ¢ — ¢’ = ed under the U(1) transformation, the second case where 
@ has an expectation value in the ground state corresponds to spontaneous 
breaking of U(1) symmetry. 

One can construct the wave function of the ground state (vacuum state) 
explicitly as follows. Consider the first case where (0|¢|0) = 0. We first write 
& = (¢1 + ig2)/ V2, where ¢1, ¢2 are real fields. We then introduce the mode 
expansions 


¢i(z) = D> aixux(a) (12.10) 
k 


for i = 1,2 and where u;(z) is a complete set of real functions for x within a 
box of volume V = L?. We can take u,(x) to be eigenfunctions of —V? with 
eigenvalues k*. Explicitly, if we impose the boundary condition ¢ = 0 on OV, 
we may take 


3 
ape) = (Z) — (123) 
with 
[ee up (a)u(x) = dnt (12.12) 
In a similar way we can write 
5obi = >_ pikur(a) (12.18) 
k 


The canonical commutation rules give 
[dik Pit] = 243 One (12.14) 
and the Hamiltonian can be written as 
iL 
—s Do (pie + we Ge) + Hint (12.15) 
k,i 


w? = k? +a. If we ignore the interaction Hamiltonian Hin with the idea of 
including it perturbatively, then the Hamiltonian is that of the harmonic os- 
cillator form and the ground state wave function can be immediately written 
down. It is given by 


240 12 Spontaneous symmetry breaking 


1 Af 
Yo = (g|0) = N | [exp (- sn = N exp 5 So wr (1246) 
i,k i,k 


(NV is a normalization factor.) In this case, evidently, 


(0|b:|0) = S— (Olgix|O) wx (ar) 0° (12.17) 


k 


The semiclassical construction of the vacuum wave function can be done 
along similar lines for the case.of spontaneous symmetry breaking. In order 
to achieve (0|¢|0) = v e*~, we write 


d= ef |v + a(n +im)| | (12.18) 


and treat m1, 72 as ordinary quantum fields with (0|7;|0) = 0. Substituting 
this into the action (12.5), we find 


os i 5 [(Om)? + (Onz)? — 2\alni] — Valo m (ni +02) — “(nh +3)? (12.19) 


up to the additive constant |a|?/4b. The term linear in 7 is zero since v was 
chosen to be the classical minimum of the action, v2|a|/2b. Treating the cubic 
and higher terms perturbatively, we can construct the vacuum state in terms 
of 7, and 72. 7 is a massive field, of mass ,/2|a|. n2 is massless and it is 
referred to as the Goldstone boson. The appearance of a massless field is a 
very general feature of spontaneous breaking of continuous symmetries known 
as Goldstone’s theorem. Introducing a mode expansion 


m(z) = >> meux(a) 
k 

no(z) = >> nonun(a) (12.20) 
k 


the ground state wave function can be obtained as 


{ 1 
Wo = N exp (-4 S- Nxn?, aie 2 Sov (12073) 
k k 


where 2, = /k-k+ 2la|, we = Vk-k. With this wave function, it is clear 
that 


(0|4|0) = v e* (12.22) 


since (0|7|0) = 0, (0|n2|0) = 0, at least in the approximation of neglecting 
the interactions. (The interactions do involve the Goldstone mode and may 
vitiate this result in some cases.) From the point of view of separating out 
the Goldstone mode, a better parametrization for ¢ is 
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= € + ee) exp ( a + ia) (12.23) 


where p(x) and \(z) are the dynamical fields. Substituting this into the action 
(12.5) we get 


SS i f Op)” +(2 + ae (OA)? = al" — V2 vb p> ~ ; p* + constant 
= I 5 [(Op)? + (0A)? — 2\a|p?| + O(p*) + +constant (12.24) 


p is a massive field, of mass \/2|a| and A is the massless Goldstone boson. 
Notice that the potential term does not depend on the Goldstone mode X. 
Again, treating the cubic and higher terms perturbatively, we can construct 
the vacuum state in terms of p and X as 


= 1 2 1 2 
Wo = N exp (-4 » 2p}, — 5 Dt) (12:25) 
where the fields have the mode expansions 


x) = >) prux(«) 
x) = S> Agur(a) (12.26) 
k 


The calculation of the expectation value is now more involved because of 
the composite nature of the operator exp (iA vf vvV2). Introduce a renormalized 
operator by 


O = exp (i, S A) (12.27) 


where A is a renormalization constant to be fixed by canceling the possible 
ultraviolet divergence. We can evaluate the expectation value using the wave 
function (12.25). Since it involves (A(x)A(x)), we can also evaluate it in a 
simpler way from the equal-time value of the propagator, since A(x) is a 
free field. (There is some difference between these methods, since we used 
periodic boundary conditions for the propagator whereas, here, we have fields 
vanishing on the boundary. Since there is arbitrariness due tot he cut-off 
anyway, this difference is immaterial; the qualitative results are the same.) 
With a high momentum cut-off A, we then find 


(0|0|0) = exp (4 - a 


16722 
= (12.28) 
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where we choose the renormalization constant A to be A?/167?v?; this gives 
(0|¢|0) = ve’*, as expected from the minimization of the Hamiltonian, per- 
turbatively in the interactions of p. 

This calculation also shows that there would be exceptions in a theory 
in one spatial dimension. In this case the k-integration in the evaluation of 
(0|O|0) has infrared divergences as well and gives... 


it 
(0|O}0) = exp (4 = 16m 1og(A/in)) (12.29) 
where kin is a low-momentum or infrared cut-off. v is dimensionless in one 
spatial dimension and we have set it to 1 without loss of generality. We can 


renormalize O at some scale py, choosing A = ;g- log A/y. This leads to 


ae 167 
(0|O|0) = a 
aC (12.30) 


Thus the expectation value (0|¢|0) vanishes as the infrared cut-off goes to 
zero. This is an infrared effect and has nothing to do with our renormaliza- 
tion prescription. Large-scale (small k) fluctuations of the potential Goldstone 
field destabilize the ground-state expectation value. This problem exists for 
any massless field; in general, one cannot have spontaneous breaking of con- 
tinuous symmetries in one spatial dimension. This result is known as the 
Mermin-Wagner-Coleman theorem. 

The construction of the ground state is semiclassical in the sense that 
we choose the classical ground-state configuration ve'* and build a wave 
function that peaks around this value. This will be a good starting point if 
one can do an fi-expansion of the theory where the higher-order terms are less 
significant. In an h-expansion, we can include corrections to YW and (0|@|0). 
However, if an f-expansion is not suitable, there is no general rule for the 
construction of the ground-state wave function, even though it can still be 
indirectly characterized by the expectation values of various operators. 


12.2 Orthogonality of different ground states 


The ground state wave function we have constructed gives (0|¢|0) = ve’. 
Different values of a correspond to different ground states, so for this section, 
we will write |a) for the ground state which gives the expectation value ve’”. 
Classically we see that they all have the same energy and so we have a set 
of degenerate classical ground states. What is the value of the phase a that 
we should use for the quantum ground state? Does the true ground state 
correspond to a specific value of a or is it a superposition of |a) over different 
a’s? This is the natural question in using the states |a). We now show the 
following results. 
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1. (a|0) = 0. More generally (ala’) = 0 for a # a’. 
2. Physical results are independent of the value of a. 


These results show that one can build up the quantum theory by choosing 
one value of a; one must make a choice but it does not matter which value 
is chosen. 

The second statement is easy to see; the substitution of the field @ as 
in (12.23) led to the action (12.24) which is manifestly independent of a. 
Therefore all results obtained from this action will be independent of a. As 
for the first statement, notice that a change in a is equivalent to a shift of the 
Goldstone field A. The operator which generates this is 09. Thus, formally 
we may write 


la) = exp (~ivav3 f ax a») |0) (12.31) 


The expectation value of exp(iA/vV2) for this state is e**. We will use a 
cut-off on the spatial volume V = L° and define the integral of Op, as 


/ Bx Id = jj Bz en? /L? ay3| (12.32) 
L—-o0o 


The overlap of two states |0) and |a) is given by 


(Ola) = (0| exp (-ivav3 f 3.) |0) 
= exp (-v%a? [ @r(a)rw)) (12.33) 


Evaluating the expectation value in the exponent we find 


3 : 
J (oxox) = al os Badd y wp ete (@-¥) e—27/L?—-w?/L 


2 J 4a)? 
=i (12.34) 
2 
so that 
(Ola) = exp (-50"v?Z") (12.35) 


The overlap vanishes as exp(—V 3). This applies also for general a, a’ and 
further for (0|O|a) for any local operator O. The ground states are thus or- 
thogonal in the limit of large volumes and there are no transitions between 
them induced by local operators. (Notice that the result is obtained only 
for V — oo which will also correspond to infinite number of degrees of free- 
dom; for systems with finite number of degrees of freedom, truly there is no 
spontaneous symmetry breaking. In practice, large systems, e.g., macroscopic 
crystals, can be approximated by the infinite system.) Notice also that the 
vanishing of (0|q) for all a 4 0 shows that the symmetry cannot be unitarily 
implemented in the quantum theory. 
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In summary, we have found that the U(1) theory (12.5) can exist in two 
phases. For a > 0, one has the symmetric phase where the U(1) symmetry 
is manifest, realized in the Wigner mode. For a < 0, the U(1) symmetry is 
spontaneously broken and we have the Goldstone realization. 


12.3 Goldstone’s theorem 


We shall first consider Goldstone’s theorem classically using an O(N)- 
symmetric theory as an example. We consider the action 


S= i d*x Fras - a) (12.36) 


The O(N)-transformations are given by dyq = 04(T) aby», where (T4) ap are 
generators of the symmetry group G = O(N) given as N x N antisymmetric 
matrices. The kinetic term is obviously invariant under these. ‘The invariance 
of the potential term gives 


OV 
OYe 


(T*) .0ne= 0 . (12.37) 


Differentiating this once more, we find 


O°V 
pap 


OV 
(T)ache + Ben (Pog —0 (12.38) 


We evaluate this at the vacuum expectation value ~, = vq, which is the 
solution of (OV/Oy~q) = 0. Equation (12.38) then becomes 


O2V 
— T4) acVe = 0 12.39 
(saan ) p=vu ( ) ‘ ( ) 


The mass matrix which appears upon expanding S around the vacuum ex- 


pectation value is 
eV 
Ma = ( ) (12.40) 
y=u 


Opa0Vr 
Equation (12.39) can thus be written as 


Mares = 0 (12.41) 


where €4 = (TA), .t¢. 

The generators T“ can be divided into two classes. The generators which 
annihilate v° form the isotropy subgroup of v, (or little group of v,) denoted 
by H € G; these generators of H will be denoted by t®. The generators of 
G which correspond to the broken symmetries, i.e., those which are not in 
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the algebra of H, will be denoted by S*. For the H-generators €4 = 0 and 
we have no information about their masses from (12.41). For the generators 
which correspond to the broken symmetries, £4 = €° are in general not 
zero. We then see that they are eigenstates of the mass matrix with zero 
eigenvalue; the zero eigenvalues of the mass matrix correspond to massless 
particles. If all €’ are linearly independent, we have shown that for every 
broken generator there is a massless particle. This is the classical version of 
Goldstone’s theorem. The linear independence of the £*, which is needed to 
complete the proof of the theorem, can be seen as follows. If €* are linearly 
dependent, then there is a nonzero solution for c; with €'c; = 0. This means 
that the quantities S;; = &:aja, considered as the elements of a matrix, 
correspond to a matrix of zero determinant. Since S;; is symmetric, we can 
diagonalize it by an orthogonal transformation R;;. (This is not in O(N) 
but is an orthogonal transformation on the directions corresponding to the 
generators, in a subgroup of O[N(N — 1)/2].) The diagonal version of 5;; 
is Si; = CiaCja Where Gia = Rji€ja. The diagonal elements are manifestly 
positive and nonzero. If Gj, is zero, then we have a larger isotropy subgroup 
for the vacuum expectation value vg. Thus £° are linearly independent and 
Goldstone’s theorem follows. 

Transformations which leave the vacuum invariant form a subgroup H € 
G, which is called the invariance group of the vacuum. This is also the little 
group or isotropy group of the vacuum expectation value. Equation (12.3) 
tells us that (Oje*@? og e~*?°|0) = Das (8) (0|¢,|0), so that 


e210) = |0) => Das(O) (0|¢4|0) = (0|¢a|0) (12.42) 


The invariance group of the vacuum, H, is called the “unbroken” subgroup of 
the symmetry group G; it is the largest unitarily realized symmetry group of 
the theory. If there are several fields with corresponding vacuum expectation 
values, H is the largest intersection of the respective little groups. 

We now turn to Goldstone’s theorem in the quantum theory. Starting with 
a continuous global symmetry of the Lagrangian, by Noether’s theorem, there 
is a conserved current J", i.e., 


6,J4 =0 (12.43) 


Consider now an operator A(z) which transforms nontrivially under the sym- 
metry, or does not commute with the charge Q. 


[Q(t), A(0)] = B(O) (12.44) 


The strategy of the proof is to consider (0|B|0) which has to be nonzero if 
we have spontaneous symmetry breaking. From the conservation law (12.43), 
we get 


ie | dz (0, J" (2), A(0)] 
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= f asio'(2), A] + F100, AO) 
d 
= F1a(t), A(0) | (12.45) 


where we used the fact that ¢dS;J°* vanishes at spatial infinity if the fields 
vanish sufficiently fast or if we have periodic boundary conditions. We can 
thus conclude that 

(0|B|0) = (0| [Q(é), A(O)] 0) (12.46) 


is independent of t for any t, i.e., time-independent. Using 
J°(ax, t) = e *P Fett 790, O)e Pte? =. (12.47) 


and inserting a complete set of states, equation (12.45) becomes 
(2n)° a ERO (n| A(0)|0)e~*?"* 
Tr 


— (0|A(0)|n) (n|J°(0)|0)e**"* | = (0| BIO) 
(12.48) 


where E,,kp, are the energy and momentum of the state |n). The right- 
hand side of this equation is independent of time because of (12.46), and 
it is nonvanishing because we have the premise of spontaneous symmetry 
breaking. Since positive and negative frequency contributions cannot cancel 
out mutually, this equation can be satisfied only if we have a state |G) with 
Eq = 0 contributing on the left hand side. Because of the 5)(k,) only 
states of zero momentum contribute on the left-hand side and so we have the 
following result. 


For every continuous global symmetry of the Lagrangian which is 
spontaneously broken, there exists a state with energy EF — 0 as the 
momentum k — 0 (Goldstone’s theorem). 


This state corresponds to a massless particle if we have the relativistic relation 
between energy and momentum. The theorem is valid even for nonrelativistic 
situations such as those that occur frequently in solid state physics. The field 
excitation corresponding to this state is called the Goldstone mode. For the 
Goldstone state |G), it also follows that we must have 


(0|A(0)|G) # 0, (0|J°(0)|G) #0 (12.49) 


We may thus consider the current J° or the operator A as generating the 
Goldstone state. The field corresponding to the Goldstone particle is a 
Lorentz scalar. This follows from the fact that it is created by the current and 
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the parameters of internal symmetry transformations are scalars. If a Lorentz 
tensor gets a nonzero vacuum expectation value, we would break Lorentz in- 
variance; therefore, in a relativistic theory, B and A are Lorentz scalars. The 
Goldstone state, created by A, is also a scalar and hence a spin-zero boson. 
B and A need not be fundamental scalar fields; they can be composite opera- 
tors. For example, for the spontaneous chiral symmetry breaking in quantum 


chromodynamics, they are of the form @y°q or gq, where q are the quark 
fields. 


12.4 Coset manifolds 


Consider a compact Lie group G and a subgroup H C G. We define a quotient 
or coset G/H as given by all elements g of G with the identification g ~ g’ for 
all g, g’ € Gif g = 4g’ h for some element h € H. In other words, elements of G 
differing by an element of H are considered equivalent and correspond to the 
same element of G/H. Just as a Lie group G is a differentiable Riemannian 
space, so is G/H. The Lie algebra of the generators T4 of G can be split 
into the subalgebra of generators t® of H and an orthogonal set of generators 
S". By considering infinitesimal elements of G of the form g = 1 — iT464 = 
1 —it*6% —iS*0* = (1—iS*6*)(1—it*0%) = (1—iS*6*)h, we see that S* define 
the coset directions near the identity element; in particular, dim(G/H) = 
dimG — dimH. 

For many applications we are interested in functions on a coset space. 
These can be obtained from functions on G. Consider the set of all functions 
on the Lie group G. We denote the matrix representation of an arbitrary 
group element g by D2, where R denotes the irreducible representation and 
m,n are matrix labels taking values 1,2,---,dimR. The functions D2, are 
called the Wigner D-functions. For the angular momentum algebra, these are 
of the form D?,,,(0) = (j,m|exp(iJ*6°)|j,n), where |j,m) are the standard 
angular momentum states and J® is the angular momentum operator. The 
standard completeness theorem for groups says that, for a compact Lie group 
G, the functions {D%,,} form a complete set, where we include all unitary 
irreducible representations (which are also finite dimensional). In other words, 
we can expand an arbitrary function on the group as 


a (CEn Pe) (12.50) 
Rym,n 


Functions on the coset space G/H, by definition, must be invariant under 
g — gh, for elements h of the subgroup H. We can obtain a basis for 
functions on G/H by restricting to the subset of D’s which obey 


Ht (gh) = eats) (12.51) 


This means that the possible choices for the right index n in DZ, must 
correspond to states which are singlets of H € G. 
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On a Lie group there is a natural Riemannian metric known as the Cartan- 
Killing metric. With g denoting a general element of G, g~!dg can be ex- 
panded as g~'dg = —iT4E/(y)dy!, where yp’, I = 1,2,---,dimG, are the 
parameters of the group. BA are called the frame fields for the space G. 
We may also write this as E4 = Efdy! = 2i Tr(I’4g7'dg), where we have 
used the normalization Tr(T4T?) = 464%. The Gartan-Killing metric can 
be given as 


ds* = —2 Tr(g~ ‘dg g”*dg) 
= EA4B4 
= EAE4 dy! dy’ 4 (12.52) 


(It should be kept in mind that any parametrization of G is only a choice of 
local coordinates on G; in general one cannot find a single global coordinate 
system.) 

A set of frame fields for the coset space can be defined by E* = 
2i Tr(S*g—1dg); the metric for the coset is then given by 


ds? = E"E? 
= Gij(p)dy*dy? (12.53) 


For h € H, we find E*(gh) = hi E)(g) where h’, is H-transformation matrix 
in the representation to which the S* belong, hS*h~! = hiss . The metric ds? 
for the coset is invariant under this, ds?(gh) = ds*(g), so it can be taken to 
be independent of the parameters corresponding to H C G. The remaining 
parameters, the y’s indicated in (12.53), are local coordinates for G/H. 

We now give some examples of coset spaces. 


1. SU(2)/U(1) 


The space SU(2)/U(1) is the usual two-sphere S?. A general element 
of SU(2) can be parametrized as g = a + ib’o; where o; are the Pauli 
matrices. The condition of unitarity and the conditon of unit determinant 
give a? + b'b’ = 1 with a,b’ real. Thus SU(2) is topologically S*. Define 
a U(1) subgroup by the o3-direction by a general element h = a+ ib’a3 
with a? + b’? = 1. We write b' = b’é*, which gives the remaining parameters 
as €' obeying é*¢' = 1 corresponding to a two-sphere S?. Another way to 
parametrize g is 


ei lean eix/2 
ies Wagar e i) ( 0 e-ix/2 (12.54) 
By computing g~ ‘dg and identifying E*, we get the coset metric as 
dz dz 
ds? afin (12.55) 


~ “C+ zz? 
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This is the standard metric for the two-sphere; the usual parametrization in 
terms of 6,” is obtained by z = tan(6/2)e"”. 


2. Other coset spaces 


There are many other examples of coset spaces. Two simple sets which 
are spheres of various dimensions are SU(N)/SU(N — 1) = S?N-! and 
SO(N)/SO(N — 1) = SN~!. Another interesting example is the complex 
projective space CP"~!, which is given as SU(n)/U(n — 1). We will not dis- 
cuss how the identification of the spaces corresponding to these cosets can be 
made; some of it will become clearer later in this chapter. 


12.5 Nonlinear sigma models 


Consider a set of fields yp“ (a) which take values on some Riemannian manifold 
M. In other words, y“ are coordinates on M. y4(z) then provides a mapping 
from spacetime to M, 

pois): RO uM 


The space M, into which the mapping is done, is called the target space. 
Let Gap(y) denote the Riemannian metric tensor on M. The action 


S= [es 5GaB(—)ovAd*y® (12.56) 


defines a nonlinear sigma model on M. In Chapter 8, we have already dis- 
cussed how to set up a functional integral for sigma models. 

In our present context we are interested in target manifolds which are 
cosets of groups. In this case, using the form (12.53) of the metric, we can 
write 

o= -of? fds Tr(S*g~10,.9) Tr(S*g10%g) (12.57) 
f is a quantity with the dimensions of mass; it will play the role of the 


coupling constant. An important particular case is when we have G itself, or 
H =1. This is called a principal chiral model and the action is 


S= -p fate Tr (97 Oug g '0"g) (12.58) 


12.6 The dynamics of Goldstone bosons 


The dynamics of Goldstone bosons is given by a nonlinear sigma model with 
G/H as the target space. We will illustrate this first by an O(V) theory. As 
noted before, the action is of the form 
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om [es Fae a vo] (12.59) 


The O(N)-transformations are given by Ya — yy = Rasy, where Rap are the 
matrix elements of an (NV x N)- rotation matrix. The potential V(y) is taken 
to be invariant under these. Further, V(y) will be taken to have a minimum 
at Ya = (ba) with (a) (¢a) # 0. This will be the vacuum expectation value 
in the quantum theory. As a particular choice we may take 


0 
(ga) = Va = | (12.60) 


Vv 


where we have represented vg as the components of a column vector. Ro- 
tations on the first N — 1 entries which are all zeroes do not change this 
vector; so the little group of this vector is O(N — 1). Thus the vacuum expec- 
tation value (12.60) will spontaneously break the O(N) symmetry down to 
O(N — 1). The O(N — 1)-transformations are rotations matrices of the form 


h= ( ; ) (12.61) 


where r is an arbitrary (N—1) x (N—1)- rotation matrix. Clearly, hapvp = Va- 

One can take the vacuum value (¢,) to have some other components 
nonzero, rather than (¢y) as we have done in (12.60), a different orientation 
of (¢) in N-dimensional space, but this is equivalent to an overall O(N)- 
rotation of (12.60). The minimum of the potential fixes (¢,)(¢.2), so that 
any other choice must be an O(N)-rotation of (12.60). For such a choice 
(¢’) = Ro(¢) for a specific matrix Ro, the little group elements are of the form 
RohRo*, where h is given in (12.61). Thus the little group is still isomorphic 
to O(N — 1), but its particular form is a similarity transform of the previous 
one. As emphasized in our discussion of the U(1) case, we have to make a 
choice of orientation for the vacuum value, but physics is independent of the 
particular choice. 

The fields Ya, as fluctuations around the vacuum value can be parametrized 


as 
Ya = Rav(y)% (12.62) 
where 
0 
0 
Va = j (12.63) 
0 


v + p(z) 
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Rap is a rotation matrix which is spacetime-dependent; it can be thought 
of as made up of the components ya. Since hasvy = va (and this is true 
even if the O(N — 1) matrix r depends on the spacetime coordinates), we 
find that Ra» and (Rh)qs lead to the same fields in (12.62). Thus only the 
parameters corresponding to O(N)/O(N — 1) will appear in the expression 
for Y,. For any z, y* = Yaa is the same for any R; thus the degrees of 
freedom contained in R correspond to the field degrees of freedom where 
2 is fixed. Since there are N y’s, this means that we have the degrees of 
freedom corresponding to an (NV — 1)-dimensional sphere S$ ~—!, which shows 
that O(N)/O(N —1) = S*-1 
We now substitute the parametrization (12.62) in the action and obtain 


S= / d‘x E (Op)? —V(u+p) — i o7(R10,R R-10"R)o| (12.64) 


The potential energy V depends only on p and generally gives a mass term 
for this field. The term involving the field R gives the kinetic term for the 
(N — 1) Goldstone bosons; they are massless as expected. There is also an 
interaction term between the Goldstone particles and p since we have @ in 
the term involving R. At energies which are low compared to the mass of 
the p-particle, only the Goldstone particles can be excited and the action 
describing their dynamics is 


S= 5 | d*x vT(R-10,R R-0"R)v (12.65) 


Expanding R—!0,,R as -iT4E#0,.y® we are led to the expression v7 T4T 8 v. 
Tv = 0 for all generators in the little group O(N — 1), so only the generators 
in the orthogonal complement can contribute. These are matrices of the form 
(T*)ab = (S*)ad = $(5iaddn — 5idan), 80 that v7 S*SIu = 46", The action 
(12.65) can be written as 


S= ; ‘ig i d+z EB’ Eid, p°0"y? 
7 : f? / de Gal(v)d,e°" (12.66) 


where f = v/2. The dynamics of the Goldstone bosons is thus described by 
a nonlinear sigma model with the target space G/H. 

The action (12.65) obviously has the full global G-symmetry, where R — 
M Rand where M is a constant O(V)-rotation matrix. This is to be expected 
since the breaking is only spontaneous, i.e., by choice of vacuum state. If we 
make an infinitesimal variation R —> (1 — iT7464(z))R, the variation of the 
action is 


6S = i d+z i v™ (R71T40,,R)v 0404 (12.67) 
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(We have used the antisymmetry of T4.) This shows that the action has 
the invariance for global G-symmetry when the parameters 64 are constant. 
Further it identifies the current associated with this symmetry as 


J, =u (hoes (12.68) 


Also we see that the equation of motion for the fields in Ris just the conser- 
vation of this current. 

Our discussion so far has been classical. In quantum theory, even at low 
energies for the external particles in a scattering process, we cannot neglect 
the massive particles; they can contribute as internal lines in loop diagrams. 
Secondly, there are interactions among the Goldstone bosons contained in the 
action (12.66) because there are many nonlinear terms in general. Thus there 
are loop contributions due to the Goldstone bosons as well. There are many 
types of corrections to the quantum action from the loop diagrams. First of 
all, there will be many corrections to the terms involving the p-field. Such 
terms in the effective action I’ will not be important for low-energy processes 
which do not involve p. Secondly, the addition of loop corrections will replace 
the parameters of the action (in particular the potential V and hence the 
vacuum value) by renormalized parameters. Following through the simpli- 
fications above, we see that the effect is to replace f by some renormalized 
value. Finally there will also be many terms involving higher-order derivatives 
of the Goldstone fields. To quartic order in the derivatives, we can have, for 
example, terms like (v7 R~10, RR~10"Rv)?, (v7? [R~10,,R, R-10,R]v)?. Such 
terms do occur and must be included in I’, but their contribution to scatter- 
ing processes at low energies is small compared to the leading quadratic term, 
since the derivatives become momenta of the particles involved. Thus at low 
energies, they can be neglected too. (f is the scale parameter in the action, so 
this means that we are looking at k/f < 1 for typical momenta k.) Summa- 
rizing, we can conclude that the low-energy effective action for the Goldstone 
bosons for spontaneous breaking of a continuous symmetry G down to H 
is given by the G/H-sigma model. The parameter f is, in principle, calcu- 
lable from the full theory, but one can discuss the dynamics of Goldstone 
bosons using the sigma model, taking the value of the one parameter f from 
experiment. 

Even though we have used a Lagrangian model with scalar fields, the re- 
sult regarding the dynamics of Goldstone bosons is quite general. Thus we 
may have a fermionic theory with symmetry breaking via a composite oper- 
ator developing a vacuum expectation value, for example, (0|¢q|0) # 0. The 
Goldstone bosons themselves are then bound states of the fermions. Never- 
theless, the low-energy effective action for the Goldstone particles is given by 
a G/H-sigma model. This follows from the fact that the Goldstone particles 
are created by the current, namely, (0|J°(0)|G) 4 0. The only expression for 
(0|J,.(x)|G) consistent with this, as well as Lorentz invariance and current 
conservation, is 


12.7 Summary of results 253 


(0| J} (x)|G, k) = (constant) k,, (12.69) 


(This is consistent with conservation since k? = 0.) This equation tells us 
that the components of the current corresponding to the broken generators 
produce y’s from the vacuum, irrespective of whether they are bound states 
or not. For the current itself, we must, therefore, have a relation of the form 


Ji = f?d,0' + O(y?) (12.70) 


for some constant f, and y represents the Goldstone field. The action which 
gives a current of this form is 


S= Sf f ats apo + 0() (12471) 


The symmetry transformation for the broken symmetries is of the form y* > 
y' +0", for small 6*. Since we only have spontaneous breaking of the symmetry, 
i.e., symmetry breaking only in the choice of the vacuum state, this action 
must have the full G-symmetry. The terms which are higher order in y’s can 
be thus fixed by G-invariance. This leads immediately to the sigma model. 
In other words, our result for the low energy dynamics of Goldstone bosons 
is quite general. 

Finally, we note that, if we parametrize R as R = exp(—iT4y4) = 1 — 
iT 494 + O(y), we can simplify the expression for the current (12.68) of the 
sigma model as 

Ji, = f’ dup" + O(¢?) (12.72) 


This is for the generators corresponding to the broken ones. This result is in 
agreement with the general formula (12.70). For the O(N — 1) directions, we 
do not get a term linear in the fields since the unbroken generators t® vanish 
acting on the vacuum expectation value v. The corresponding currents are of 
the form - 

In = ff pa,~' + O(*) (12.73) 


where [t®%, S*] = if%7.9/ define the structure constants f%. 


12.7 Summary of results on spontaneous symmetry 
breaking 


1. Gis the symmetry group of the Lagrangian. It is also the symmetry group 
of the Hamiltonian. 

2. G\0)  |0) , H|0) = |0) implies that G is spontaneously broken down to 
Eee’G: 

3. This can be realized if minimization of energy levels leads to nonzero 
vacuum expectation values (0|A|0) # 0 where A transforms nontrivially 
under G and the isotropy group of the vacuum expectation value is H. 
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A. Transitions among different degenerate vacua do not occur in the limit 
of infinite number of degrees of freedom, (ala’) — 0. 

5. For every broken generator of a continuous global symmetry of the La- 
grangian, there exists a massless spin-zero particle (the Goldstone boson). 

6. The low-energy dynamics of the Goldstone bosons, including their mutual 
interactions, is given by a nonlinear sigma model with target space G/H. 

7. The current for the broken symmetries has the form Ji = f?0,¢'+O(9”), 
where y’ are the Goldstone fields and f is a constant. 


12.8 Spin waves 


The simplest example of spontaneous breaking of continuous symmetry is 
the ferromagnet. This is a spin system where we have a spin variable at each 
lattice site of a three-dimensional lattice. Neglecting other degrees of freedom, 
a good approximation for the Hamiltonian of the system is the Heisenberg 
Hamiltonian 


=~ X Jig Si Sj (12.74) 


where 1,7 refer to lattice sites and a denotes the spin vector at site 7. J;; 
is the so-called exchange integral, and this falls off for large separations of 
the sites; very often it is a good approximation to keep only the nearest- 
neighbor interaction in (12.74). (We use the letter H for the Hamiltonian 
and for the unbroken subgroup since this is conventional; it should be clear 
from the context what is meant.) The Hamiltonian (12.74) has symmetry 
under rotations, so we have an O(3) symmetric theory. The ground state is 
ferromagnetic with neighboring spins aligned along the same direction with 
a net overall magnetization if J; ;41 is positive and antiferromagnetic with 
neighboring spins having opposite orientation if J; ;,1 is negative. Either 
way, the ground state breaks the O(3) symmetry. For a ferromagnet, with 
the spins aligned, the ground state has net magnetization. There is a residual 
symmetry for rotations around the axis of net magnetization. Therefore the 
O(3) symmetry is broken down to O(2). In the limit of large lattices and for 
long-wavelength excitations, we can approximate this by a continuum field 
theory. The symmetry breaking O(3) — O(2) will give two Goldstone modes. 
These modes are the spin waves. Since O(3)/O(2) is S?, the dynamics of 
these modes is given by a sigma model with S? as the target space. There is 
no Lorentz-invariance here, so it does not make sense to write a relativistic 
effective Lagrangian. However, we can say that the effective Hamiltonian is 
of the form of a sigma model, i.e., 


H=— ff dx E oe (12.75) 


The field Z describes the spin waves. By expanding this powers of the field 
Z,Z, this Hamiltonian can describe the interactions among spin waves. 
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Quantum chromodynamics is the theory of strong interactions, the theory of 
quarks and gluons. It is an SU(3) gauge theory, with the gluons being the 
gauge particles and the matter fields being the quarks transforming in the 
fundamental representation of SU (3). The degrees of freedom associated with 
this SU(3) symmetry are called the color degrees of freedom. Thus each quark 
field has a color index which takes values 1, 2,3, the different components 
transforming into each other under SU(3) gauge transformations. There are 
six species of quarks known, denoted by u, d, s, c, 6, and t, called the up, 
down, strange, charm, bottom, and top, respectively. We will denote these 
by Q?,, a = 1,2,---,6, respectively. i is the color index taking values 1, 2, 3. 
The up and down quarks have masses of the order of a few MeV, the strange 
quark has a mass around 150 MeV; these are the light quarks. Masses for 
charm, bottom, and top are, approximately, 1.2 GeV, 4.2 GeV, and 174 GeV, 
respectively; these are the heavy quarks. The quarks get masses from the 
spontaneous breakdown of the gauge symmetry of electroweak interactions; 
the scale for this is approximately 246 GeV. Thus for the dynamics of quarks 
at energies well below this scale, we can take a Lagrangian where the masses 
are included in by hand. The QCD Lagrangian then has the form 


1 a ypapv Fi (5 j 
L=-7FLL PM +) Qa(i7: Diz — mabiz)Q (12.76) 


where the covariant derivative is given by 
(Dy)ig = (Op + Ap diz 
= 0,613 — te AA (T) ij (12.77) 


Here Ad, A= 1,2,---,8 are the gauge potentials for the SU(3) gauge sym- 
metry. T4 are hermitian, traceless (3 x 3)-matrices which form the generators 
of SU(3) in the fundamental representation. e, is the strong interaction cou- 
pling constant. 

Consider the quark Lagrangian without the masses, namely, 


L= SQ, iy: Diz Q, 
= ss Ora vy ‘ Di; @ 3 + ye Ono vy . Di; a. (12.78) 
a a 
where in the second step we have split the quark fields into the left and right 
chiral components Q;, = 3(1+7°)Q and Qr = 5(1—7°)Q. This Lagrangian 
has a global U,(6) x Ur(6) symmetry given by 
Qh a Qin =a UapQ6 
Qa = Qita aa Vop Ong (12.79) 
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where U, V are unitary (6 x 6)-matrices. This is the chiral symmetry of 
strong interactions. The mass terms do not have this symmetry; so it would 
seem that this symmetry is not very useful. However, for the light quarks, 
this chiral symmetry is very useful as an approximate symmetry, since the 
explicit breaking of chiral symmetry due to the masses is small compared to 
the relevant scale of the gluon interactions. So we write QCD for the three 
species (or flavors as they are called) of quarks by restricting a, @ to the 
values 1, 2,3 corresponding to the up, down and strange quarks. Taking the 
a, 2 indices as understood, the quark part of the Lagrangian is 


L= Qriy DQi + Qriy -DQR- (Q.M!QpR + QrM!Qz) (12.80) 


where M? is the quark mass matrix 


en (0 0 
Mi = Ota titg (12.81) 
0 QO ms, 


The symmetries of this Lagrangian are as follows. 


1. The color gauge symmetry corresponding to 


Qi Qe =95(2) %, 
Ay A, =g9(z) Ap g (2) — Ong 9? 
(12.82) 


where g(x) is an SU(3) matrix. This symmetry is not broken but leads to 
confinement of quarks and gluons. The spectrum of the theory has only 
bound states of quarks and gluons which are singlets of this symmetry. 

2. Discrete symmetries like parity, charge conjugation, and time-reversal 
are respected by this theory. Quantum effects related to the topology 
of the gauge field configuration space and instantons lead to a possible 
PT-violating term, the so-called 6-term. This is discussed in Chapter 16. 
Experimentally, 9 < 10-9. The theoretical reason for the smallness of 
this value is not currently clear; nevertheless, based on experiment, the 
PT-violation due to this effect can be taken to be zero. 

3. There is the U(3)z x U(3)r chiral symmetry given as in (12.79). At the 
algebraic level of infinitesimal generators, U(N) ~ SU(N) x U(1); so 
we will discuss the U(1)’s separately. We have SU,(3) x SUp(3) sym- 
metry. The SUy(3) subgroup of this symmetry group is defined by the 
transformations 


Qa = UapQis 
Qra = UnpQre (12.83) 


In other words, SUy(3) corresponds to U = V subset of (12.79). 
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4. The U(1) transformations are given by 


U(1)L: Qi, =e" Q1, Qr = Qr 
U(1)r: Qt = Q2, Qr=e'"Qr 
(12.84) 
We can form combinations Uy (1), U4(1) defined by 
Uy (1) : 0, = ”9Q,, Qh = e*/Qp 
Ua(1): A, =e"Qn, Qr=e “Qr 
(12.85) 


Uy(1) is even under parity and its conserved current is a vector. It can 
be identified as the baryon number. U,4(1) is odd under parity and its 
current is an axial vector. 


The chiral symmetries are broken in many ways. 


1. The U(3); x U(3)pr chiral symmetry is spontaneously broken down to 
U(3)v ~ SUy-(3) x Uy (1) by the effects of the gluonic interactions (the 
gauge field A”). The energy scale at which this happens is approximately 
140 AfeV. The effective Lagrangian for the Goldstone bosons due to this 
symmetry breaking is valid up to approximately 1 GeV since there are 
some additional numerical factors in the scattering amplitudes. It is a 
theorem, due to Vafa and Witten, that vector-like symmetries cannot be 
spontaneously broken in a theory with vector-like coupling between gauge 
particles and the fermions. So SUy(3) x Uy(1) is the smallest group to 
which the chiral symmetry can be spontaneously broken. 

2. The mass terms break the chiral symmetry explicitly. If M, = mol, ie., 
all the quarks have the same mass, then SUy(3) x Uy(1) is the unbro- 
ken subgroup. If the quark masses are different, then SUy(3) is further 
broken. This breaking due to masses is explicit and ultimately comes 
from the electroweak interactions since the quark masses are generated 
by coupling of quarks to the electroweak Higgs field. 

3. There is further explicit breaking due to electroweak interactions coming 
from the coupling of electroweak gauge bosons and the quarks. Among 
other effects, this leads to the mass difference between the + and 7° 
mesons. 

4. The axial U4(1) symmetry is explicitly broken down to the cyclic group 
Ze (Zan, for Ny species or flavors of light quarks) by anomalies which 
are due to quantum corrections. 


The baryon number symmetry Uy(1) is not spontaneously or explicitly 
broken by the strong interactions; it is actually broken by anomalies in the 
electroweak sector and B — L, baryon number minus lepton number, is the 
only global symmetry which is obtained in the standard model of particle 
interactions. 
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The hierarchy of these breakings is also important. The strongest break- 
ing occurs for the spontaneous chiral symmetry breaking and the anomalous 
breaking of U,4(1). The energy scales for these are comparable and of the or- 
der of 1 GeV. The explicit breaking of chiral symmetry due to masses is next 
in order, followed by the breaking of SUy(3) due to mass differences. Elec- 
troweak interactions come next and the weakest breaking is that of baryon 
number Uy(1) due to electroweak anomalies. 

The broken generators for the first breaking SU,(3) x SUR(3) — SUy(3) 
are axial in nature and this leads to eight Goldstone bosons which are pseu- 
doscalar. These correspond to the 7*, 7°, K+, K°, K°, and n mesons. Be- 
cause of the explicit chiral symmetry breaking due to masses, we are start- 
ing from an imperfect symmetry, and as a result, these mesons are not true 
Goldstone bosons; they are actually massive and are called pseudo-Goldstone 
bosons. Their masses should go to zero as the electroweak couplings and the 
quark masses are taken to zero. Further, since the explicit breaking scale is 
smaller than the spontaneous breaking scale, it is still useful to think of them 
as Goldstone bosons, with masses added to the effective action. The masses 
are indeed significantly smaller than the scale of the spontaneous breaking 
of approximately 1 GeV, so this is consistent. There are also mass splittings 
among the Goldstone bosons because SUy(3) is not a perfect symmetry ei- 
ther. 

SUy(3) with breaking effects due to mass differences is an approximate 
symmetry of the theory, realized in the Wigner mode, and the states (mesons 
and baryons) can be classified by this. The hierarchy of breakings also tells us 
why it is not useful to include the heavy quarks in this analysis. Their masses, 
which are higher than the 1 GeV scale of spontaneous breaking, explicitly 
break the chiral symmetry so badly that it is not useful to think of a chiral 
symmetry for them, even as a first approximation. 


12.10 The effective action for chiral symmetry breaking 
in QCD 


We can now construct an effective action for the pseudoscalar mesons and for 
the baryons. The dynamics of the mesons should be governed by a nonlinear 
sigma model with G/H as the target space, where G = SUz,(3) x SUpR(3) 
and H = SUy(3). We can represent an element of SUz,(3) x SUR(3) by 

= (G,,Gr) and the vector subgroup corresponding to the transformations 
(12.83) by h = (9,9), where Gz, Gr, g are (3 x 3)-matrices which are 
elements of SU(3). Since g is arbitrary, we can write g = GLV for some 
bag sede’ V of unit determinant. We then regard V as the variable in 
h = (GLY, GLV), so that by considering all V € SU(3) we get SUy(3). In 
the coset, we Gdentify the elements G and G h. 


h = (GiGRV, V) = (GrGR,1)V,V) - (12.86) 
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This shows that the coset can be represented by the SU(3)-matrix U = 
Gr ew In other words, 


SUz(3) x SUR(3) 


aay Lee (12.87) 


The Goldstone particles, the mesons in this case, can be represented by an 
SU(3) element U. An element (gz, gp) of the chiral symmetry group SU;(3)x 
SUpR(3) acts on Gy, and Gp by translating them to new elements of the group 
as (gLGi,gRGR). Their action on U is thus of the form 


U-+U'=g, U gi, (12.88) 


Choosing U = 1 to represent the vacuum, namely (0|U|0) = 1, shows that the 
isotropy group is gz = gr, which is the vector subgroup of transformations. 
(We can choose U to be any constant matrix in the vacuum, the orientation of 
the vector subgroup in G is slightly different compared to (0|U|0) = 1, that is 
all.) Following our general discussion, the effective action for the pseudoscalar 
mesons is given by 


2 
- fn | d*z Tr(d,U'04“U) (12.89) 


We can parametrize U as 


ae (14) 0-i2e) (12.90) 


where IM is a hermitian traceless matrix. The effective action becomes 


M? M?2\~* 
4 ps 
S= pass —y C4 0,M (1+55) fs) Mm 


=f de | 5T(O.Ma"M) - sa T(M79,MonM) + 


(12.91) 


The identification of the physical meson fields is given by M = t?¢%/V2, 
where ¢% are the meson fields and t® for a basis for the SU(3) algebra; they 
are traceless hermitian matrices normalized as Tr(t*t?) = $6°°. Written out 
as a matrix, 
ym + en nt Kt 
M= a —jgm + en ~«K° (12.92) 
ew kK —/2 n 

where we have used the particle names for the fields representing them. (Ac- 
tually the 7 in this equation is not quite the physical 7 because of mixing 


260 12 Spontaneous symmetry breaking 


with the SU(3) singlet pseudoscalar meson; we will not discuss these details 
here.) The eight fields here form the octet representation of SUy (3). 

The quartic terms in (12.91) can describe the scattering of the mesons. 
The mesons interact with the baryons as well. This can be included as fol- 
lows. The baryons also transform as representations of SUy(3); they are 
made of three quarks each. In terms of SUy(3) indices, they are of the form 
Qe Lt ein. This is antisymmetric in the color indices; so we can have the 
fully symmetric representation for the flavor indices a, 3,7, which is also 
symmetric in spin, in accordance with the exclusion principle. This gives a 
(ten-dimensional) decuplet of spin-2 baryons. Another possibility is to anti- 
symmetrize a pair of flavor indices and corresponding spin indices, giving an 
octet of spin-3 baryons. The lowest mass baryons are the octet; they are of 
br um = 
the antisymmetrization gives spin-. The particle identification is given in 
terms of BA = $ Bagye*7, 

cial i ve“ 1 ae 1 4 
i aw Aes a (12.93) 
= —\/2 A 
with a similar structure for the antibaryons. The baryon kinetic energy term 
should be BSiy - OB2. 

As for the meson-baryon coupling and baryon mass term, they have to be 
determined so that the Lagrangian has the full chiral symmetry and symme- 
try breaking should be apparent only when we expand U around the vacuum 
value 1, consistent with the fact that the chiral symmetry is broken only spon- 
taneously. The left-chirality spinor is a two-component spinor y,, r = 1, 2, 
which transforms by the (2 x 2)-matrix representation of Lorentz transfor- 
mations. Since this matrix has unit determinant, we can form invariants by 
contracting the spinor indices with e"*. Thus a left-chirality baryon can oper- 
ator can be of the form Q;Q,_Q_, with two Q,’s forming a Lorentz-invariant 
combination or Q;QrQr with the two Qr’s forming an invariant. To illus- 
trate how we can form interaction terms, we will use one combination, say, 


Q1Q1Q_, in what follows. Since €461gae’9@p' = €a'B'y' Ole, for the baryon 
matrix, the transformation rules are of the form 


the:foumeBuem=iO54( 


; k — : 
his ay: =)€ijk, Where n,7,s are spin indices; 


by 
> 
II 


Peel ceed 
= 
— 


B,—grBg, Br—grBgl, (12.94) 
It is easily checked that the baryon mass term Tr(B, Br + BrBrz) is not 


chirally invariant. By use of the transformation property of U, an invariant 
term can be constructed, leading to the effective baryon action 


Sp = | ds 


Tr(B iy-0 B)+ gf, Tr (BLUBRU' + BaU'B,U) 


(12.95) 
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Here g is some constant. By writing this out in terms of the fields, and using 
the expansion for U, we then find the baryon mass to be mp = Ofn and 
the pion-nucleon coupling to be 9g, = g. The pion-nucleon interaction 
igNy°r*Nr® is the original Yukawa interaction, slightly modified to take 
account of the pseudoscalar nature of the pion. The relation mg = g,0n fr 
is known as the Goldberger-Treiman relation. (This is to the lowest order; 
there are corrections to this relation, related to form factors for the axial 
vector current.) 

So far we have not addressed the explicit breaking of chiral symmetry. 
The most important consequence is the mass term for the mesons. This is of 
the form 


2 
Smore = f ate TEM (U +04 -2) 
1 
~ pes [- 3M) + MM") oe | (12.96) 
The matrix M should have a structure similar to the quark mass matrix 


to incorporate the pattern symmetry breaking due to the quark masses and 
mass differences. We can take it to be of the form 


a 0 
M=10 a O (12.97) 
0 b 


with a = a’ because the up and down masses are approximately the same, 
and b > a,a’. The as yet unknown coefficents a, a’, b may be related to the ex- 
perimental meson masses. Expanding out (12.96) and identifying the masses 
in terms of a,a’,b, we get one of the Gell-Mann-Okubo mass formulae, 


3m2~ 4me—m, . (12.98) 


which is in reasonable agreement with the experimental values. There are 
other sources of symmetry breaking as discussed earlier, and some of these 
can also be incorporated nicely in the effective action; we will not discuss 
them further. 

As another example of the use of the effective action, we consider the 
™ — 7m scattering at low energies. For this purpose we can neglect all the 
K,7 terms in (12.92) and write M = 7 - 1/2, where the r* are the Pauli 
matrices and 7+ = (7! tim?)/./2 and 1? = 7°. There is an obvious SU(2)y 
subgroup which acts on these fields; it is called the isospin symmetry. We use 
7 for the Pauli matrices because here they are applied to the flavor isospin 
symmetry. The terms in the effective action (12.91) which are relevant for 
ma — 7m scattering are 


1 
Sint = alt [r- Oy» OP — an mot WT (12.99) 
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We consider pion-pion scattering with the isospin labels and momenta as 
(kya)(kob) — (k3c)(kad). The scattering amplitude for the process is given 
by ; 


(k3c, kad|S|k,a, kob) = 54) (ky im ka ~ ks —ks) A 


laa may 
A =f (: a =) + baba (SP a) 


‘ 2 
+5ad5be (: “P| (12.100) 


fi 


where we have introduced the so-called Mandelstam kinematic variables, s = 


(ky + ko)?, t = (ky — kg)”, u = (ky — kg)”, which also obey s+ t+u = 4m?. 


By taking a = b = c = d = 3, we get the amplitude for 197° — 19770 


as A = im?/f?. For mtx” => 197, we need a = b = 1,a = b = 2 and 
a=1,b=2, c=? bes andveaode 3. Tha caomeae eae 

It is also interesting to work out the currents for the chiral symmetries. 
We find 


= Fen, 0 
Ly a 9 (t BL ) 
2 
iF T(CU Od, U) 
Wu = Sint Jiu = femme (U~*d,U — ,U U~*) 


Jin Si, — Jay = ie (U-*d,U +0,U U~*) (12.101) 


When these are expanded in powers of M, the terms involving just the pion 
fields are 


1 Ht 
Jin & gfrOur" - sem On ++: 
it 1 
Tiny © ~ a fnOun" - yom Our” Pisce 
Ji, &® eA eo +. 
Jan © frOym® +++ (12.102) 


Notice that the formula for the axial vector current, which corresponds to 
the spontaneously broken symmetries, is in agreement with the general result 
(12.70). With the one-pion state |kb) we get 


e tke 


V 2WEV 


(0|J4,,|kb) = —if,-ky,d” (12.103) 
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If J%,, is conserved, the above relation gives k? = 0 as is appropriate for 
massless particles. In the present case, the current is not conserved because 
of the explicit breaking due to the quark masses. From the above relation 
—tker 


0/0, J%"|kb) = —m? f,.62°-<____ 
(0/0, 4" | ) mf iif 


This is known as the PCAC (partial conservation of axial vector current) 
relation. 


(12.104) 


12.11 The range of validity of effective Lagrangians and 
unitarity of the S-matrix 


The effective action for chiral symmetry breaking in QCD captures the low- 
energy features of the theory. This action was written down based on symme- 
tries, although, in principle, such an action can be derived starting from the 
QCD action. There are many other situations where such an effective low- 
energy action can be constructed and used for many calculations. In some 
sense, all actions we use are of this nature, since the high-energy behavior of 
the standard model of particle interactions has not been tested beyond a cer- 
tain point, approximately a TeV. Clearly there is a certain range of validity 
for the use of any effective Lagrangian, since it is based on a low momentum 
approximation. For scatterings and other processes beyond a certain cut-off 
value of momenta, the effective Lagrangian will have to be modified. Also, 
generically, an effective Lagrangian will involve terms which are not renormal- 
izable. Calculations have to be done with a cut-off. One can then ask whether 
this cut-off value can be estimated starting from the effective Lagrangian it- 
self. This is especially important in situations where we do not know the 
high-energy version of the theory, having obtained only a phenomenological 
theory based on low-energy experiments. We will now show how the unitarity 
of the S-matrix provides one way to make an estimate of the cut-off. 
As shown in Chapter 5, the scattering operator is given by 


S = U(co, —0o) 


0 


x 
U(x, y°) = T exp | dioxin .) (12.105) 
y® 


U is a unitary operator and this leads to the unitarity condition StS =1. 
This condition is a statement of the conservation of probability in the theory. 
The fields in £Ln¢ are the free in-fields. 

The unitarity of the S-matrix gives certain general constraints on cross 
sections, decay rates, etc. For example, consider the scattering of two particles 
of momenta k1,k2 into any number of particles. The initial state is given by 
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|i) = |ki, ko). For simplicity assume that incoming particles are not identical 
particles. By writing S = 1+ 7, the unitarity condition becomes 


(T-T)+T'T=0 - (12.106) 


Taking the matrix elements of this relation for the state |i), and using the 
completeness relation for the states, 5> rif \(f| = 1, we get 


S- MFT? = -4 (a\(T — TIA) (12.107) 
i 


(Here we will assume that the forces are of short range, so that there is 
no divergence in the forward scattering amplitude.) (f|T'|z) is the transition 
amplitude for the initial state |) to be scattered to the state |f). It is of the 
form 


(fIT |e) = Fay on oe—h— o) Mal ay 


TRG 7 Teer ev 
(12.108) 


where M,; is the invariant amplitude. Calculating the total cross section o 
from here, we find 


= [f(T |é) |? —orT Vv (hi ka)? — mim3 (12.109) 
af 


Wh, Wk. V 


where 7 is the total interaction time and we have used the formula for the 
flux (5.52). 

n (i|T|z) , we have the same initial and final states, so that the momentum 
conserving 6-function should be replaced by Vr as explained in Chapter 5. 
We can then write 


i 


i 
= 
(i|T |i) = 3a, V dunn, V Vr Mi (12.110) 


M,; is the invariant amplitude for the (elastic) forward scattering since the 
final state is the same as the initial state. Using results (12.109) and (12.110) 
in the unitarity conditon (12.107), we get 


(Mi — Mj,) 
(ky - ko)? — m?m2 
ImM i; 
2 


[(s — m3 — m3)? — 4m?m3]? 


(12.111) 


where s = (k, + k2)*. Equation (12.111) between the total cross section 
and the imaginary part of the forward scattering amplitude is known as the 
optical theorem. 
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Unitarity also imposes restrictions on how fast cross sections can grow 
with energy. The general way to demonstrate this explicitly requires finding 
a basis which partially diagonalizes the scattering matrix, such as an angular 
momentum basis which corresponds to the partial wave analysis of the ampli- 
tude. Here we will do a simpler analysis just to illustrate the point about the 
unitarity constraints. Consider the elastic scattering of two spinless particles 
with the momenta ki, kz — pi, po. We will use the center-of-momentum 
frame, in which case k; + kg = 0. The basic kinematic variables are thus 
s = (ki +k2)? and the scattering angle k, -p, = |k.| |p,|cos@. The invariant 
matrix element M is a function of s and cos@. Using the formula (12.108), 
we can calculate the total cross section as 


do SS sao ee 
- 2)? — mima (27)? 2s (27)? 20,5 


x(2m)45 (p, + po — ki — ke)|M? 


: |M|?d2, (12112) 


- 6472s 


where we have used the kinematic result 


[(s — mj} — m2)? — 4m2mi| 


12.113 
Ee ( ) 


Ip, |? = |po|? = 
The quantity ImM,; which occurs in (12.111) is the imaginary part of the 


elastic amplitude evaluated at 6 = 0. We thus have an obvious inequality 
(ImM,;)? < |M|?(@ = 0), which leads to 


6477s deel 
i ee (Fe) (12114) 
~ Its = me - m3)? og 4m?m3] dQ, 6=0 


We can now apply this line of reasoning to terms in the effective La- 
grangian which are of dimension higher than zero and hence non-renormalizable. 
As an example, consider a term like 


Lint =9 / d*x (Oy)*x? (12.115) 


where we have two scalar fields y, x of masses m; and meg, respectively. g 
is a coupling constant of dimension (mass)~*, analogous to the parameter 
1/f2 in the effective Lagrangian for mesons. To lowest order in g, the elastic 
scattering amplitude for y x — 9 x is given by 


M= 4g ky sy ehl (12.116) 


where ki, p; are the momenta of the y-particle before and after collision, 
respectively. For the cross sections we get 
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doe! = g’ 4 
ahem » Ants" 


au er 
Cet = | Wi + 3Pl (12.117) 


Notice that o., grows like s for large s. Since o,; < o; the inequality (12.114) 
becomes, for large s, 
9 . 48m? 
s* << ——. 
g 


The unitarity bound is not respected by the lowest-order calculation for s > 
/48nm7/g. The effective theory has to be replaced by a more fundamental 
theory, or at least a theory with more fields included, before we get to this 
energy. (This is a very approximate bound; more stringent bounds can be 
obtained by a more detailed analysis of unitarity.) In essence, this argument 
is valid even if loop calculations are taken into account. One can impose a cut- 
off on the loop momenta of the effective theory, construct an effective I’ which 
will be a series of terms with higher and higher dimensions. S-matrix elements 
can be calculated by solving the equations of motion for I’ and evaluating 
I on the solutions, following (8.115). The calculation is essentially what we 
have done here. Unitarity bounds can be viewed as giving constraints on the 
parameters of I’. 


(12.118) 


12.12 Gauge symmetry and the Higgs mechanism 


So far we have discussed the spontaneous breaking of a global symmetry 
where the transformation matrices are constant as a function of the spacetime 
coordinates. When we have spontaneous symmetry breaking for a symmetry 
which is local, in other words, for a gauge symmetry, Goldstone’s theorem 
does not apply. In this case, we get massive gauge bosons. This can be illus- 
trated by considering the U(1) gauge theory. 

The action for a U(1) gauge theory with a complex scalar field is 


s= | de 


where the gauge transformations are given by 


2 2 
~ 7 FavF#” + (Dud)"(D*d) —d (oe - 5) | (12.119) 


Ay > Ai, = Ay + 0,0(z) 
¢— ¢ =exp(ieb(z) ) o (12.120) 
This model is often referred to as the Abelian Higgs model. (In this section, 


we will use ¢ for both the classical and quantum fields, since we need to 
go back and forth in our discussion.) The energy corresponding to (12.119) 
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is minimized when ¢ has a value v/ V2 which is realized in the quantum 


theory, as before, as the vacuum expectation value. For fluctuations of the 
field around this value, we parametrize ¢ as 


1 
(x) = exp( ie£(x) ) —=(v + PAI 
(2) = exp ( ief(2) ) (0+) (12.121) 
Substituting this into the action (12.119), we get 


s= [dx 


ig 1 1 
~yFuF 7 3 (Qn00"p) 2p 50 Wwe a5 p)? 


i 
=v" p* — Aup* — a (12.122) 


where W,, = A, — 0,€. We can rewrite the theory in terms of W rather than 
A. Since the relation between the two is in the form of a gauge transformation, 
Fi (A) = Fu,(W). Thus, 


S = Sw +S, + Sint 
d. 
a je -7 pu (W)FPY CW) + sow," 


OF = [ae | 5(00) = 52m0?)p?| 
2 
Sint = pas [e2opw,.w ac =e WwW" — dup? — 7e'| (12.123) 


The particle content can be read off this action. We have a massive gauge- 
particle, of mass ev. The kinetic term for ¢ has led to this mass term for W,,. 
There is a massive scalar particle p with m? = 2\v?. Then there are a number 
of interaction terms. Notice that there is no massless Goldstone particle. The 
would-be Goldstone field is (x) in the parametrization (12.121). But, from 
the gauge-transformation property (12.120), we see that this field is like a 
gauge parameter. In fact, since @ and e*°’¢ are physically equivalent, we can 
even write the vacuum value of ¢ as e*°°u/,/2 for some function @ and choose 
@ appropriately to get rid of € from the field ¢. This is to say that it can be 
moved into the gauge potential by a gauge transformation, which is what we 
have done when it is absorbed into the vector field W,,. The action (12.123) 
is thus displayed in a particular gauge; it is called the unitary gauge. We 
can do further gauge transformations to write this in other gauges, when the 
W2-term will have the form (W + 06)?. This method wherein the would-be 
Goldstone field is absorbed by the gauge field, which then becomes a massive 
field, is known as the Higgs mechanism. 

It is interesting to see how the gauge symmetry is important in removing 
the massless modes using a Hamiltonian framework somewhat along the lines 
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of our discussion of the electromagnetic field in Chapter 6. We have already 
seen that we can choose a gauge where Ag = O and then split the spatial 
components as A; = AT + 0;f where A? is transverse, i.e., 0A? = 0. Using 
the parametrization (12.121), the @part of the action can be written as 


So= f d's = [é? - (ale - f) = APY | 


e*y? (. 
= f atc SF [@-(ae-pyr-aPat] (zag 


s 


where we have ignored the p-field since it is not important for this discussion. 
It is clear that the transverse part of A; has a mass ev. The equation of motion 
for € is 

E€-9,0E=0 ° (12.125) 


where € = £— f.. The equation of motion for the Ag-component, or the Gauss 
law for the theory, is 


8;(0;f) — e?u?& = 0 (12.126) 
This can be solved for €, in terms of Fourier modes, as 
i= | earl oe (12.127) 
Using this in equation (12.125), we find 
a2 eas 
k-k a +k-k+ ou & =0 (12.128) 


For k # 0 we see that w? = k- k + e*v?, showing that this mode is also 
massive, with the same mass ev. For this mode the result can be extended to 
k =0 by continuity. (It is possible to have a mode which is spatially constant 
which is related to the orientation of the vacuum as before.) 

We have already seen that the true gauge group of the theory is the set of 
all gauge transformations which go to the identity at spatial infinity, namely, 
G,. in the notation of Chapter 10. It should be emphasized that there is no 
breakdown of this true gauge symmetry here. In fact gauge symmetry is 
crucial in removing the massless modes. What is broken is the global part of 
the symmetry, corresponding to G/G,. To show how this works out in some 
detail, we will use a gauge-invariant field y. In terms of the Coulomb Green’s 


function Gc(x,y) we can construct a gauge-invariant combination of ¢ and 
Aj as 


x= em (-ie fay Go(e,y)@Aily)) ¢ (12.129) 


This is invariant under transformations which go to the identity at spatial 
infinity. Under transformations which go to a constant at spatial infinity, 
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x transforms as x — e*°°(%)y. Thus x is gauge-invariant but has charge 
since it transforms under G/G,. The potential V(¢) = V(x), so the vacuum 
expectation value of x is v/ V2. This nonzero value for xX is consistent with 
the invariance of the vacuum under G,. In terms of ¢ we can write 


(0|d|0) = exp (~i [ev cot. uA.) as (12.130) 
V2 
The parametrization of ¢ in (12.121) becomes 
as oe ‘I 
b= exp(ied) exp (~ie fay Go(maAiy)) Te(o+ 6) (12.481) 
Combining this with the Gauss law, we get 
i 

Sas pes 5 [p? — (Vy)? — e?v?y?] (125132) 


where 


k-k ~ 
yp = ev ko Boe E (127133) 


The fact that the vacuum is defined by (0|x|0) = v//2, or equivalently 
(12.130), shows that the G,-symmetry is not broken, even though we refer to 
this situation as the spontaneous breaking of gauge symmetry. The vacuum 
expectation value for x, however, does break the global part of the U(1) 
symmetry. 

The Higgs mechanism does not change the number of degrees of freedom. 
Before symmetry breaking, we have the modulus and phase of ¢ and the 
two transverse polarizations of the gauge particle for a total of four physical 
fields. After symmetry breaking, the gauge boson is massive and has three 
polarizations. Combined with p, this still leaves four independent fields. The 
would-be Goldstone boson becomes the third polarization of the massive 
vector particle. 

We will now consider how this extends to nonabelian symmetries. The 
mass term for the gauge fields arises from the kinetic term for the scalar field 
which is of the form 


Sp= f ax ((D.8)z(D"d)a— Vibe) (12.134) 
where ¢, transforms as some representation of the symmetry group G. The 


covariant derivatives are (Dud¢)a = Queda — ieAA(T) ado. When the vacuum 
is such that ¢ has a nonzero expectation value, we can parametrize it as 


ba = Jar(Z) alm + pb) (12.135) 
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The symmetry G is broken down to H C G. Since an H-transformation is 
identity acting on vy + pp, only the G/H elements really appear in (12.135). 
Substituting this in (12.134) and going to the unitary gauge, we get 


a= pes Ea v (EAP Ne WA Apu + p_ terme (12.136) 


where wa is the unitary gauge version of Ag The generators of the Lie 
algebra of H annihilate the vacuum value v*, so the mass matrix is nonzero for 
the coset directions. The gauge hosons corresponding to the broken generators 
get masses, with a mass matrix given by (M?)48 = g? v7 (T4T®)v. One can 
also easily check that the counting of degrees of freedom works out as well. 

Finally, even though we have used a scalar field to describe the symmetry 
breaking, it is possible, just as in the case of chiral symmetry breaking in 
QCD, for composite operators to get vacuum expectation values and so break 
the symmetry. The effective action for the Goldstone bosons then becomes 
the mass term for the gauge particles. 
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All the particles and interactions known to date, except for gravity, are de- 
scribed by the standard model. This is based on the gauge group SU(3) x 
SU(2) x U(1), where SU(3) corresponds to QCD and SU(2) x U(1) corre- 
sponds to the electroweak interactions. There are gauge bosons corresponding 
to these groups. The matter content falls into three types, the leptons, the 
quarks, and the Higgs scalar field. The electroweak SU(2) x U(1) symmetry 
is spontaneously broken down to a U(1) subgroup which is identified as the 
gauge group for electromagnetic interactions. The electroweak gauge fields 
couple differently to the left and right chiral components wz,p = 5(1 +7°)y 
thereby breaking parity symmetry explicitly. It is therefore easier to spec- 
ify the transformation properties of various fields after separating the chiral 
components. The covariant derivatives in general can be written as 


Dix = (a, — ie, AAT — igbtt* — idx) x (12.137) 


for any field y. AA. bY, Cy are the gauge fields corresponding to SU(3), SU(2), 
and U(1), respectively. e, is the strong interaction (QCD) coupling constant, 
g,g' are the coupling constants for the SU(2) and U(1) groups. T4, t* are 
matrices corresponding to the generators of SU(3) and SU(2), respectively, 
which are in the representations to which the field y belongs. Y is the U(1) 
charge of the field x; it is called the weak hypercharge. The matter field 
content of the theory is given in the table shown below. The fields | denote the 
lepton fields, Q’s the quarks, and ¢ is the Higgs scalar field. Notice that the 
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same pattern of fermionic fields repeats itself three times; these are referred 
to as the three generations, usually called the electron generation, the muon 
generation, and the tau generation. We could combine the notation by writing 
l*, Q*, etc., where i = 1, 2,3, label the three generations. 

In the table, we have denoted the representations by their dimensions. 
For the singlet representations, T“ and t® are zero. For the quarks which are 
triplets under the QCD SU(3) group, we may take T4 = M4, Ase), cca 
with 


010 0 -i 0 
a) ey 0) a0 0 
00 0 0 0 0 
cea 00 -4 
\y, Seon |, otal 0 
100 r O).0 
000 G0" 6 
Kee |e Lp Om as (12.138) 
01 0 + © 
1 © ie @ 
ie 200i eS =o 1 0 
0 0 0 3 \ Quoin 


Matter fields in the standard model 


SUS) SU (2) U(1) 
Field x representation|representation|charge 


1 2 
1 i 


a.=(3)( 


Urn =urR, Cr, tr 


Dr=drp, 8p, OR 
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The T4 obey the commutation rules [74,77] = if48°T°, where the struc- 
ture constants {48° may be worked out from the explicit choice of matrices 
given above. For the doublets of SU(2), we can write t? = aT, where T°, 
a = 1,2,3, are the Pauli matrices and we have [t®, t°] = ie*°¢t®. The electric 
charge of the field is given by Q = t? + sY. Given these matter representa- 


tions, we notice that the transformation 


Son “ece 
v4 
g = exp (iFr2v3) exp (i2nt?) exp (iY) (12.139) 


acts as identity. All the fields aré invariant under this. g and its multiples form 
a discrete subgroup C of SU(3) x SU(2) x U(1); and thus, strictly speaking, 
the gauge group of the standard model is 


be SU(3) x SU(2) x U(1) 


G ; 
CG Ne 140) 
The relevant field-strength tensors for the gauge fields are 
FA, = O, AP — OA + ef APC AB AS 
G4, = Oyb% — 0,b2 + ged? bf 
Suv = Oper — Ovep (12.141) 


The action for the theory can now be written as 
ge [es Ly Ly + Lyon Lol 
il i a apv 1 Vv 
La = att ae se owe BY qh? 
Gp = aii" (a, ~ iegn? Ay — 1g — Ley) OF 
vi iv 1 a Q AajA 29° t 
+Upiy aaah A, i en Up 
es t rt 
+Diiy" (a, tet eee Zen) Di, 
—s U . —_— iF . 
+iy" (a, — igt"b) Fo] i tlpiy' (Ou + ig’cy)lig 
Lyuk = —fijlid Uh — F4Q,b UR — (2016 Di, + hee. 
. papa ao - t vu? 
jig | 8, ~ igt*be, ~ ie, a| -A(¢te- > (12.142) 
Here ¢ = it ¢* and h.c. stands for the hermitian conjugate. ¢ transforms 


under SU(2) exactly as ¢ does, but has the opposite U(1) charge. It, stands 
for the three charged leptons fields er, Rr, TR. We have neglected neutrino 
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masses, which are very small, and the right-handed neutrinos. The right chi- 
rality neutrinos and neutrino masses can be included by use of the so-called 
Majorana-type coupling and Majorana masses. We will not discuss these. 

We now consider the simplification of the SU(2) x U(1)-part of the theory. 
The scalar field action has a potential energy term which gives a nonzero 
vacuum expectation value to ¢°. In other words, 


1. 70 
0|¢|0) = — 12.143 
(lalo) = = (¢ ) (12.148) 
This vacuum expectation value breaks the SU(2) x U(1) symmetry sponta- 
neously; the isotropy group of the expectation value (12.141) is the U(1)em 
corresponding to electromagnetism which is generated by the electric charge 
operator Q. Thus we have the breakdown SU(2) x U(1) — U(1)em. The 
combination ee 

g OL+9 Cp 

g? ae g” 

is the electromagnetic gauge field and remains massless. Here sinOw = 
g'/V9? + 9'%; 9w is known as the Weinberg angle. The orthogonal combi- 
nation 


A, = = sin Owb? + cosOwe, (12.144) 


g be, =g'c Cr 
/g? + g!? 


and the two fields W* = (b}, + ib2)/./2 become massive. 

One can andexstand the inal content and the nature of the interactions 
by exapnding around the vacuum expectation value given in (12.143). For the 
Higgs field, a general parametrization is given by 


=U(0) = ( ei ») (12.146) 


where U(C) = exp(it?¢?/2) is an element of SU(2), ¢*(x) are the would-be 
Goldstone bosons. When this is substituted into the Lagrangian, we see that, 
by virtue of the gauge invariance, we can absorb the fields ¢ into the gauge 
potentials; this is the transformation to the unitary gauge. In this gauge, the 
gauge and Higgs terms of the electroweak part of the Lagrangian simplify as 
follows: 


A= = cos Ow ba — sin Owe, (12.145) 


1 
£ = 1 (0,Ay — Ay)? — (8.20 — 2.2.) ~ 513.W. — 8.0? 


+Mi,Wi we + = 5M3z, Ge 5 (an)? ~ 5 an + Lint 
WwW 
(12.147) 
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Ligas= SAW. — ALW,|? — © cos 6w|Z,.WL — Z,W,.|? 
~ieA" [W-"(0,Wt - O,Wt) -Wt’(a,.W, -a,W,)| 
—ig cosOw Z" [W-”(0,Wt - Wt) -Wt(d,W, — dW, )] 
+ieWt*W-* (0, AL — 0, A,) + ig cos OwWTEW’ (OnZ, — OZ) 
—eg cos Ow Z" [(A,W, — AW )Wt’ + (A,Wt - AW )W-*) 


a M2 
voy ia —WiWw,)? + te WT*Wrn + sage ate 
M2 M2 M2 M2 
a a Z*Z un + _ Mz pu 79) - r, 7"? = oar n* (12.148) 


Here e = gsin Ow is the value of the electric charge of the waganny The masses 
of the gauge Sasene, to this lowest tree-level order, are Mz = 31/9? + 9 v 
and Mw = $9 v. Experimentally, Mz ~ 91 GeV and My ~ “80 GeV and 
sin? @w = 0.223. The electroweak scale v ~ 246 GeV. These values are 
obtained from experiments. The mass of the Higgs scalar particle represented 
by the field 7 is My = V2A v. There is no numerical prediction for this mass 
yet, since \ has to be experimentally determined. 

The Yukawa interactions Ly,, give fermions masses because the vacuum 
value of ¢ is nonzero. These are generally mass matrices because the coupling 
constants f;; are not necessarily diagonal. Consider how this happens in the 
quark sector. The mass term which arises may be written as 

Lmase = —MYUiUL — MED; iD + he. 

Mi = fie M¢ = f¢ z (12.149) 
U,, Dr are the chiral components of up and down quark fields, respectively. 
The physical particles are mass eigenstates, so we have to diagonalize the mass 
matrices M“, M4. We will take these matrices to be of nonzero determinant. 
If the determinant is zero, that means there is a zero mass field; we can then 
separate it and apply the following argument to the rest of the matrix. Now 
an arbitrary complex matrix M of nonzero determinant can be decomposed 
as M = H U, where H is hermitian and U is unitary. We diagonalize H 
by a paeey matrix S as H = St MaiagS. This shows that we can write 
as M = St Maaeou) = st Maiag’, where S and T' are unitary matrices. 


Thus M can be diagonalized by a biunitary transformation. We can use this 
diagonalization to write the mass term as 


Lemos = OLS" M'T"Un tno ie Dee ae: (12.150) 


where M* and M? are diagonal matrices. Now we redefine the quark fields 
as 


Up Te Dar; 
Ur — S“Uz, Di, + S*D, (12.151) 
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so that 
| nee =U,M"*UR + DirM'*Dp + he. (12.152) 
The redefinition of the quark fields makes the mass term diagonal, but 
changes the interaction terms with the W,,. 
DiyUr > Di Sts y,ULr 
Or yD > ULS*S*y, Dr (12.153) 


Thus the interaction terms with the W-bosons is 
a oe Nea WN p= = 
jo i von +W eye) 
J, = ViGUi Dt, 
Jt = ViD WL (12.154) 
where V;; = (S“S*T);; is called the Cabibbo-Kobayashi-Maskawa (CKM) 


matrix. It gives the mixing angles for the weak interactions of quarks. 
The electromagnetic current is given by 
ar z3 1 zs 
J, = 3 (Ury,Ut + UrypUR) - 3 (Dry,Dr + DryyDr) (129055) 
This is still diagonal after the redefinition of fields as in (12.151). Likewise 
the current 


la - 
yo : (OryUr — DryyDt) (12.156) 


is not affected. Further the Higgs-fermion interaction terms are proportional 
to the mass matrices and get diagonalized when the matrices are diagonalized, 
ensuring that the Higgs will not mediate flavor changing processes, in a one- 
Higgs model such as the one we are considering here. 

Collecting these results together, the quark part of the theory becomes 


ouark = O*(iy 6 M?)U* 76 D'(iy Vb M?)D* 
lai meuin = bi MsDin +eA¥ jem 
v : v i 


aie 


at See 20 
oe Wt], +W 4st) (12.157) 


( 
V2 
The neutral current J, om is given by 


i= sn oy (12.158) 


For the currents in the lepton sector, we have similar expressions. Neutrino 
masses are taken into account by including right-handed neutrinos; the left- 
and right-handed neutrinos do not have the same mass, so that one needs to 
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use so-called Majorana masses. If we neglect the neutrino masses, the lepton 
terms become 


Liepton = Niiy- ON: + EX(iy- 0-— M$)E* + Lint (12.159) 
where N*, for i = 1, 2,3 stands for the electron-neutrino, the muon-neutrino, 
and the tau-neutrino; likewise, E* denotes e, and 7 for i = 1,2,3. The 
interactions of the leptons are given by 


aa eAbigem 4 8 gaye 9 pete 7— eg 
Lint = eA Ji {agar Jn ae Jaw pb Se ia) 


~= 8 MSE 
oy a NiywEy 
Jt = EyyNi , (12.160) 
Ie — —E"y,E" 
Lies i _ Ff i : 
ae = 3 (NiwNy — Lue) = sin? bw J” 


The CKM matrix V is an n x n unitary matrix for n generations. But 
it does not have n? physical parameters in it, since some of the angles can 
still be absorbed into the definition of the fields. For example, we can write 
V = UdiagVU, diag» Where U,U" are diagonal unitary matrices; they are just 
phases of the form e’” along the diagonal. There are n such phases in U, n—1 
phases in U’, avoiding double counting of an overall common phase. U and U’ 
can be absorbed into the definition of Ur and D,; the currents Je and Jem 
are not affected by this, since U,U’ are diagonal. The mass terms change, 
but we can simultaneously redefine Ug and Dr, so that the mass terms 
are also not affected. This shows that we can get rid of 2n — 1 parameters 
leaving n* — (2n — 1) = (n— 1)? physical parameters in the CKM matrix. 
For three generations, we thus have four parameters, corresponding to three 
real rotation angles and one remaining phase. V is not real in this case. The 
fact that V is not real leads to CP-violation due to weak interactions. The 
CP-transformation of a fermion field is defined as 


p(x) =C p*(—2,2°) (12.161) 


p denotes the C’P image of 7. From the invariance of the Dirac Lagrangian 
wiy - (O — igb*t*) we see that the transformation of the gauge fields is 


—bj (—a, 2°) = bo, bF (—wx, 2°) = b; (12.162) 


where T’ denotes the transpose of b as a matrix. In terms of the components, 
this becomes 


bj = —63(—2) b} = b}(—a) 
We = —W, (-2) wt = W; (—2) 
Wo =—W, (-2) W; = W;*(-z) (12.163) 
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For the charged current interactions we then find 
~ —— SG 4 it a 
pws, = jw - Ean + Bry 
/ Wo / we Wendt + Oi ydIV5| (12.164) 


It is then easy to show that the interaction part of the action obeys the 
transformation rule 

Sint(N, E, U, D, Viz) = Sine(N, E, 0, D, V3) (12.165) 
We have CP invariance if V = V*. For three generations of quarks, not all 
matrix elements V;; are real and we have C’P violation. 

Finally, notice that the presence of the gauge interactions and the Yukawa 
couplings (which lead to quark masses as well) show clearly the explicit break- 
ing of chiral symmetry for the quark sector discussed earlier. From the point 
of view of the electroweak gauge symmetry, the spontaneous breakdown of 
chiral symmetry in QCD will break the electroweak gauge symmetry and 
this gives additional masses to the W, Z particles. But the energy scale of 
chiral symmetry breaking is so small compared to the electroweak scale that 
this effect is negligible compared to the masses due to the Higgs vacuum 
expectation value. 
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13.1 Introduction 


Consider a classical field theory with a symmetry group G which may be 
partially a gauge symmetry and partially a global symmetry. In defining the 
quantum theory, one has to evaluate loop diagrams, some of which may be 
divergent, and therefore the quantum theory has to be defined with the help of 
a regulator. There are situations for which there exists no regulator preserving 
all the symmetries. Symmetries of the classical theory which are broken by 
quantum corrections (by choice of regulators) are said to be anomalous. The 
corresponding currents have nonzero divergences which are called anomalies. 

This point about regulators and symmetries can be illustrated by some 
examples. Dimensional regularization has the great virtue that it preserves 
Lorentz and vector gauge-invariance (or the corresponding BRST-invariance). 
But 7° = (1/4! )epvagy4y’y°7" has no natural extension to arbitrary dimen- 
sions (# 4) since it uses the e-tensor. Thus chiral symmetries are potentially 
anomalous if we use dimensional regularization. Another common regulator 
is the Pauli- Villars regulator, where we add very massive unphysical particles 
with negative Hilbert space norm, the mass M serving as the regularization 
parameter. But a mass term like Ma is not invariant under chiral transfor- 
mations and so chiral symmetries are potentially anomalous. 

Anomalies are of two types, anomalous global symmetries and anomalous 
gauge symmetries. In the case of anomalous global symmetries, the symmetry 
is not realized in the quantum theory, but otherwise the theory is consistent. 
Examples are the axial U,4(1) anomalies in QED and QCD. If such a symme- 
try is spontaneously broken, we have a Goldstone boson classically. Quantum 
theoretically, since there is no symmetry, there is no Goldstone boson. In fact 
the quantum corrections generate a mass for the potential Goldstone boson. 

Anomalies for a gauge symmetry can lead to unphysical results. The 
gauge-invariance (or the related BRST-invariance) of a theory is crucial for 
the proof of unitarity of the S-matrix or unitarity of time-evolution in general. 
Gauge-invariance removes the unphysical polarizations of the gauge particles; 
if gauge-invariance is lost, they can become propagating modes and ruin uni- 
tarity. Thus in a consistent physical theory there should be no anomaly for 
the gauge symmetries. There can be an anomaly for gauge symmetry in a 
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subsector of the theory, but all the individual contributions to the anomaly 
must cancel out in the end. 


13.2 Computation of anomalies 


We now turn to the computation of the anomalies, using the Euclidean field 
theory for most of the calculations. In four dimensions, the anomalies are due 
to fermion one-loop diagrams only. Consider the Euclidean action 


ee / Few ai in) Dae Rive 
= / “a oe F24+7-(04V+4+ Ayu] (13.1) 
fs 


where br, = 3(1+75)), vr = 4(1—7°)d, and V = 3(L+R), A=4(L—R). 
We consider N species of fermions with the chiral symmetry U(N), xU(N)p. 
We have taken all the chiral symmetries of the fermion to be gauge symmetry 
so as to be very general. This means that we can write L, = —iT4E4, 
R= ~iT*R4, TA being generators of U(N). One can specialize the results 
to any subgroup of this maximal symmetry by setting some of the gauge 
fields to be zero. 

Notice that we do not write a fermion mass term because we are gauging 
the axial symmetries as well; there is no gauge-invariant mass term if we 
including the axial symmetries as well as vector symmetries. When we restrict 
to a subgroup, we may have the possibility of gauge-invariant mass terms, 
but the calculation of the anomalies is not affected by the mass term, so we 
do not need to consider it. This will become clear later. The field-strength 
tensors are defined by 


FA, = O,L4 — 0, LA + fAPCLB Le 
Fi = O,R4 — 0, RA + fABPORBRO 
Foy = OVA — OVE BP POPE VES + 7 Pe A (13.2) 
Fay = O,AP — 0, Ab + fABO (VB AC — VB AC) 
= (Dy Ay — Dy Ay)* 


where D is the covariant derivative with respect to the vector gauge fields 
only. 


One can add to the action (13.1) a term of the form 
Ee v 
Steg = — )_ / Tr (Fe a. (13.3) 
L,R 


for both the vector and axial vector gauge fields. This term, because of the 
higher derivatives, makes the gauge boson propagator take the form 
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A? A? 
Gw p+ Ap ~ oA (13.4) 
for large-momenta p. This can regulate the Feynman diagrams which contain 
gauge boson propagators, and since (13.3) is gauge-invariant, this provides 
a gauge-invariant regulator for all potentially divergent diagrams except the 
one-loop diagrams. A plays the role of a gauge-invariant cutoff. One-loop 
diagrams can involve the fermion loop, which is not regulated by (13.3). 
Also (13.3) introduces new vertices which can lead to new divergent one-loop 
graphs which need separate regularization. So we cannot make conclusions 
about the one-loop graphs. Since the gauge symmetry covers all the possi- 
ble continuous symmetries, because we have gauged all the symmetries, this 
shows that the anomalies are confined to one-loop diagrams. By power count- 
ing, one can then see that the anomalies are in one-loop fermion diagrams. 
(Higher loops can renormalize the parameters appearing in the expression for 
the anomaly.) 
The potentially divergent graphs which can contribute to the anomaly are 
shown below. 


Fig 13.1. Potentially divergent diagrams for the anomaly calculation 


In these, the external lines can be either vector (V) or axial vector (A). The 
two-point function is the vacuum polarization graph, which we have evalu- 
ated in a gauge-invariant fashion and leads to no anomaly. For the triangle 
diagram, the VV V-graph is zero by charge conjugation invariance (Furry’s 
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i dn Py (O+V + / i'n py (8-V+7AWe (13.5) 


where w° denote the charge-conjugate fields. Thus the possibly anomalous 
graphs are the AVV and AAA graphs. The contribution to the effective 
action from the AVV type of graphs is 


r®) = / d‘cd‘yd‘z Tr [S(2,y)7°7- AW)S(y,2)7- V(2)S(z,2)7- V(@)] 


2 i dys(q,P1,Pa) Str[Ay(2q)Va(pi)Va(P2)] J?" (q,p1,p2) (13.6) 


where 


d*(2q) d*p: d*po (4) 


du(q, Pi, p2) = =a je ee 4 Ei (2q — Pi — p2) 
* vy 1 a i 
EMEP aus oa (k—q)" cirri ac 
(13.7) 


Here r = 4(p1 — p2), and in Str[A,,(2q)Va(p1)Va(p2)] we are taking the sym- 
metrized trace over the generators of the group, i.e., 


Str[AVV] = sik [AVV + VAV + VV 4] (13.8) 


The integral Z#°°(q,p1,p2) in (13.7) has superficial linear divergence. The 
infinitesimal gauge transformations are 


Va 7 Va + Do 
Aq — Ag + Day . (13.9) 
which reads in momentum space as 
Va(p) + Va(p) + tpa6(p) + -- 
Ao(p) > Aa(p) + ipap(p) + +> (13.10) 
The variation of the term corresponding to the triangle graph in the effective 


action under the vector gauge transformation is then given as 


oor) = = [ du(a,p.,p2) Str[A,,(2¢)6(p1)Vo.(p2)| FY“ (q, pi, p2) 


d*k 1 1 
Lo aay (eerie SS re x 


uf Ban iL fe 
a ee —__—_— Sete 
Then) a(eeae? | OSD 
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where we have used p; = (k+q)—(k-—r) and p2 = (k—r) —(k—q). Carrying 
out the traces and writing k’ = k + pj, we find 


d*k [(k—a)(k—T)¢ 
fue =-4 pvapB uw = (fs k’ 12 
(4, P1, 2) € (2r)4 Ee — q)2(k — 7)? ( iad ) ( ) 
Thus, if we could shift the variable of integration to k’ in the second term, 
F¥“(q,p1,p2) would be zero, giving gauge-invariance of 3) for the vector 
gauge transformations. But the integral in (13.12) is linearly divergent and 
the shift can produce a nonzero surface term since 


d*k d*k Of 
/ (27)4 [f(k = a) = £(k)] = ay (27)4 OkY 
: fa, k’k? dn®) 13.13) 
~ nj —— 
(The higher terms are zero as k — oo.) Using this result, 
(3) 
LO — AcbvaB (k— qQ)v (k— r)e 3p 7 oe dQ 
FH (q, pi, P2) = 4e (k — 9)2(k — ry) k (Qn (2n)4] 
dQe os ” 
= 4tvoB (On (nyt ky (a@—1)e piak* 
Our Vol(S*) 
— AchvaB ({ SvA 
a ( 4 ) po P2B TT Gad 
ay 
= are’ Pw Pap (13.14) 
The result (13.11) can now be simplified as 
5or'® = =~ / d*a et¥°8 Tr(A,,0,0 OaVz) (13.15) 


where we have transformed back to coordinate space. 
Under an axial transformation we find 


bor = — / dyi(q, Pi, Pa) Str[p(2q)Va(p1)Va (pa) (4, Pi, Pa) 
d*k 1 1 

OB 2, 
He" (q; Pi, P2) be esa" 1d (k +4) 
1 

a B 

Vy (k=1)! | 

ae: 1 


age Ee a HP 
17 (R-a)) 7 EH” see 
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where in the last step we have used y Qy -q=7-(k- q)yy 4 yy - (k +4). 
Evaluating the trace over the y-matrices and using the symmetry Vy (pi) 
Va(p2), we can simplify this as 


(K+ qQ)ul(k-T)v _ (K+ q)u(k+7)v 
(k+q)(k—-r)? (k+9)*(b+r)? 


(kK-—q)(kK—-Tr)y _ (k-qQ)(k+r)y 
(k—g)*(k—r)? (k—a)*k+r)? 


Seas {Gates = oe ~(k—ok +m} 


at = Dee (ko +m} 


Howe = -2ees| 


(k= ak +r) 
a ao | 40°) [hy (q—1)pwrok? — k(a+7)uPrck? | 
= ge” raya (13.17) 
For 6,I°@) we then find 
§,r® = -=5 / d4z e”°° Ty (9d,,V,0aVa) (13.18) 
Combining (13.15) and (13.18), we have 
or) = as i dta #8 Tr(A,,0,0 OaV~) 
6,03) = = / d*z 4”? Ty (8, V,0aV~) (13.19) 
Both variations are nonzero. However, notice that 


i va 1 U4 
59 | / d*z eb¥oPTy (A,V.00¥9) =e i d‘z H¥*? Tr (A,,0,6 O2Va) 


82 
(13.20) 
Therefore, we define 


ie 1 
FS) — p®@ + ae ij dix: etv8 Ty (AyVLOaV~) (13:23) 
so that 


bol?) = 0 
~ 1 
bpfO = 25 i d4e €”°°Ty (8, V,OnVp) 
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"oF [ ate 204 (oh foo) 
aa =a d*e Tr ( (y Bs oa (13.22) 


where fu» = OnV. — OV, and fay a sehue’ fag. (This is essentially the 
result of Adler and Bell and Jackiw.) The local term (of dimension zero) 
which we added in (13.21) is a counterterm. Redefining the quantum action 
by adding this is equivalent to a change of regularization. Thus we see that, 
for the redefined action °°), the vector gauge-invariance can be preserved 
by a suitable choice of regulator, but not simultaneously the axial gauge- 
invariance. For the U(1) symmetry, one could also choose to preserve the 
axial symmetry, breaking the vector symmetry. For the nonabelian case, this 
is not so, since the axial transformations, by themselves, do not close into a 
group. 

The variation of the action can also be expressed in terms of the conser- 
vation of the current. From the definition e~! = fe-S, 


ar = [ dyX(0,0+7°8,0 (13.23) 
Expressed as conservation laws, equations (13.22) read 
Ou (py"T Ay) = =0 
Oy(Pr PT AW) = sg Te (T4 fav F”) (13.24) 


The result we have derived is for the AVV + VAV + VVA graphs only. 
There is also the AAA-graph. We now specialize to the purely left-handed 
gauge fields and transformations, for which V, = A, = $L,. In this case, 
the AAA-graph is equal to the AV V-graph, so that we can get the full result 
by taking 4/3 times the value we have calculated. For the left-handed gauge 
transformation by parameter ¢, we find, from the formula for V,, A,, that 
6 = $€ and y = 56, so that 


6 C) Bey ae) + aro 
3 1 i 
=6 oa] 
=e = | d*z HYP Ty (¢ 0, Ly OnLg) 
= 5 - i) da ef’ P10 (0,6 Ly OxL8) (13.25) 
TT 


Results (13.19, 13.22, 13.25) are from the triangle diagram only. Corre- 
spondingly we have only analyzed the behavior of I’ under the linearized 
gauge transformations (13.10). For example, for the left-handed gauge fields 
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and transformations, there can be L?-terms which are required for consis- 
tency under the full transformations, including the {42° L?¢°-terms. These 
corrections come from the 4-point fermion loop and from a pentagon dia- 
gram for the case of vector and axial vector transformations. The pentagon 
diagram is actually convergent, but nevertheless a piece of it is needed to 
ensure proper transformation of the anomaly. Rather than compute these di- 
agrams in turn, we can identify what is required from the consistency condi- 
tions. Consider again the left-handed fields. For the full gauge transformation 
a — LA + (DyE)4 = LA + O,€4 + fAPCLBES we define the anomaly G(é) 

Vi s 

6¢F = G(€) (13.26) 


The infinitesimal gauge transformation d¢ can be explicitly written in terms 
of the gauge fields as 


de = / d*x OO @ arr (13.27) 


These are translation operators along the gauge directions of the potentials 
and so they must obey an algebra determined by the group commutation 
rules. In fact, from the definition (13.27) we find 


dg Ogre — Og: Og — Sexer =0 (13.28) 


where (€ x ¢/)4 = fA8CeB¢'C. (If we include vertex functions with mat- 
ter fields in our consideration of terms in I’, there are additional terms in 
the definition of d¢ which carry out the gauge transformation of the matter 
fields. These can be easily included; they do not change the algebra or the 
consistency conditions derived below.) Applying the algebra (13.28) to the 
definition of the anomaly we get the consistency conditions for the anomaly, 
first obtained by Wess and Zumino, 


JeG(E’) — de G(E) — G(E x &') = 0 (13.29) 


Any expression for the anomaly must obey this condition. Having obtained 
the first set of terms by explicit calculation, we can now postulate that G(£) 
has the general form 


i 


= 24772 


if d*x e¥°P Str [0,,€ (LyOaLp + cLyLaLg)| (13.30) 


where c is a constant which can be determined by imposing the Wess-Zumino 
consistency conditions. Carrying this out we find 


a 
b¢eD = G(€) = Oy) [as cvaB oer .e (1.2.20 ata | 


(13831) 
This gives the complete form of the anomaly for left-handed gauge fields. 
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13.3 Anomaly structure: why it cannot be removed 


The WZ consistency conditions give a precise way of stating what we mean by 
an anomaly in a regulator-independent way. Notice that any quantity of the 
form 6¢W for any functional W of the gauge fields will obey the consistency 
conditions simply by virtue of the general algebra (13.28). This does not 
constitute an anomaly. In the first place we cannot add arbitrary functionals 
W to I’, which is basically determined by the theory. However, we do have 
some freedom; if W is the integral of a local monomial of fields and derivatives 
and is of total dimension less than or equal to zero, then the addition of such 
a term is equivalent to a change of regularization. W is then a “counterterm”. 
Thus if G(€) is a solution of (13.29), then G(€) + d¢W is also an acceptable 
solution for any W of dimension < 0. This would simply give the form of 
the anomaly in a different regularization scheme. If every solution to (13.29) 
is of the form 6¢W, then all the anomaly can be removed by a choice of 
regulator, which amounts to defining the effective action as f = [—W. Soa 
true anomaly exists in the theory only if there is a solution to (13.29) which 
cannot be written as d¢eW for some W which is the integral of a local term, 
the total dimension being < 0, i.e., 


deG(E") — de G(€) — GE x &’) = 0 
G(E) # OgW (13.32) 


By writing out all possible monomials of fields and derivatives of dimension 
< 4 (which correspond to terms in W with dimension < 0), we can check that 
the solution (13.31) satisfies this requirement. This shows that the anomaly 
(13.31) cannot be removed by any choice of regulator. Later, in Chapter 17, 
we shall see a topological rephrasing of this statement. 

Another important propery of the anomaly is expressed by the Adler- 
Bardeen theorem. This states that the expression for the anomaly is not 
renormalized by higher loop corrections except to the extent of replacing the 
fields and coupling constants by their renormalized values. We will not prove 
this theorem; the discussion of anomalies in terms of the index theorem and 
topological properties will show why such a theorem would hold true. It is 
a useful result, showing that the necessary cancellation of the anomalies for 
gauge transformations only need to be checked at the one-loop level. 

Since the anomaly shifts as G(€) — G(€) + d¢W under a change of regu- 
lator, the form of the anomaly can be changed to some extent by a choice of 
regulators. We have already seen that the vector anomaly can be made zero, 
or either the vector or axial one can be made zero for the U(1) case, but not 
both, by the use of counterterms as in (13.21). This can be very useful. For 
example, if we have a left-handed gauge theory, we would need to cancel the 
anomalies in the gauge transformations, but for the remainder, for which we 
have set the gauge fields to zero, and which are, therefore, global symmetries, 
we can live with the anomaly. There would be no inconsistency. However, we 
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need to have the anomaly in a form which is invariant under the left-handed 
gauge transformations. This is achieved by use of counterterms. In a similar 
way, the anomaly for axial transformations can be expressed in a form which 
is vector gauge-invariant. This form, as mentioned before, involves the square 
and pentagon diagrams, but the final form can be determined by symmetry 
and WZ consistency arguments as * eat 


G(y) = _ pes he Ser 


1 | 
of pra Fan a 19h 4u Faas 
2 
—3(ApAv Fras + ApFVvvaAg + Fy pv AaAg) 
HE Adedads) | (13.33) 


This form was first derived by Bardeen. 
We now consider the group structure of the anomaly. Consider first the 
left-handed anomaly (13.31). Since L, = —iT4L#, € = —iT4€4, we get 


: 1 
6D = — 1 gABC [as Pies i [anc (220,08 ste SO*SLELELS)| 


2472 
(13.34) 
where d4?© = Str(T4T®T°). d42° is the symmetric rank 3 invariant of 
the algebra of the generators of the transformation. For the case of gauge 
transformations, we can calculate d4?© for various fermion representations 
and check the cancellation of the anomalies. 

For right-handed fermions and the corresponding gauge fields, in addition 
to the replacement L,, — R,, the anomaly has a minus sign relative to the 
anomaly for the left-handed fermions. This is because we have V,, = —A, = 
3R, as opposed to V, = A, = 5L, for the left handed case. 

Finally, notice that 6¢I° is imaginary. With our Euclidean calculation, I is 
usually real. This shows that the lack of gauge-invariance is in the imaginary 
part of the effective action. 


13.4 Anomalies in the standard model 


In the standard model, the gluon fields are vectorial in their coupling to 
fermions. As a result, we can choose a regulator for which there are no gauge 
anomalies. Alternatively, the gluons couple equally to the left and right chi- 
ralities; so the anomaly from the left and right sectors cancel out. 

For the electroweak transformations, the couplings are not left-right sym- 
metric; as a result we can have possible anomalies for the gauge transforma- 
tions. These have to cancel out and give zero gauge anomaly for the consis- 
tency of the theory. The relevant anomalies arise from the b®, b? ¢ and c? 
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triangle diagrams. Thus we need to calculate d4?© for each of these; we do 
this here for one generation of quarks and leptons. 


b® graph 


In this case, fields are either singlets for which there is no coupling to gauge 
fields or doublets for which the generators are t* = $77. 


(tel ete) 
gabe — Str ( — ) = 1 ay(r28e) =Q (13.35) 


a. 
Thus all the anomalies of this type are zero. 
b? © graph 


In this case we need 
d¥o {Str(¥7%7") = 75° THY) (13.36) 


where Y is the weak hypercharge matrix for the left-handed fermions. (Since 
the other vertices of the triangle diagram involve the left-handed gauge field 
b,, right-handed fermions do not circulate in the loop; so we only need the 
trace of Y over the left-handed fermions.) For one generation of quarks and 
leptons, we have 


i 1 
Oe Da | eee cs 
ES aS SNe SS 
UL dr, 
=0 (13.37) 


The extra multiplicity of three for the quarks is due to the number of colors 
being three, i.e., because they are triplets under the QCD group SU(3). This 
anomaly cancels between quarks and leptons. 

c® graph 


The c? anomaly is given by Tr(Y*) and for this, both left and right fermions 
can contribute. We get 


i 1 
Tr(Y?) = |(-1) + (-1) + Rs | <2 
VE en 
UL dr 
64 8 
= == 3 
So a 
eR 
UR dr 


II 
SN 
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eee’, 
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(13.38) 
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In this case, the cancellation involves quarks and leptons and both chiralities. 

So far we have considered anomalies in gauge invariance. But there are 
also anomalies in the standard model for continuous global symmetries. Two 
global symmetries of interest correspond to baryon number B and lepton 
number L and these are both anomalous. Only (B — L) is anomaly-free. 
These anomalies have no bearing on the consistency. of the theory unless we 
try to make these into gauge symmetries, in which case only the combination 
(B — L) can be consistently gauged. 

Consider again one generation of quarks and leptons, say the electron 
generation. Lepton number corresponds to the transformation 


Vv F V nr 
ye ATi 


O30 (13.39) 


The leptons v, e have lepton number equal to 1, while the quarks have no 
lepton number. Baryon number corresponds to the transformation 


v— DV, e—e 
U iB /3 6) 
—e 
Gee iG) 
ur —e/3 up, dr — e*/3 dp (13.40) 


The leptons have zero baryon number, while the quarks carry a baryon num- 
ber of z. 

The anomalies arise from triangle diagrams with either baryon or lepton 
number at one vertex and b? or c? at the other vertices. (Other diagrams are 
important for the covariant completion of the expression for the anomaly.) 
The variation of the effective quantum action is then worked out from our 
general formulae as 


aie -i f ae a(z) ald] + cole] go) 


VL,EL VL eL ER 


-i f ae B(a) call ae scale + cal - = eal = cold 


UL dy UR dr 


= ~i f ax (ale) +6(2) (cts 2 cle) (13.41) 


Ura 


where 


nia 1 
calb] = gppe® Tr Fuv(b)Faa(b) = —a 


LVaB ra a 
ae. Fav Gap 


13.4 Anomalies in the standard model 293 


1 
C2 [e] = 32n2 uO F fag 
Ghiv = Onb§ — O,b5 + €9°b2,b6 
Fuv = Oper - OvCy (13.42) 


in the notation for the field strength tensors used in (12.141). (cq is the second 
Chern class and is related to the instanton number as we shall see in Chapter 
16.) In equation (13.41), for the quarks, there is degeneracy factor of 3 due 
to the QCD group SU(3), which cancels the factor of i due to the baryon 
number. From (13.41) we see that both baryon number and lepton number 
are anomalous, but the combination (B — L) corresponding to 6 = —a is 
anomaly-free. 

Baryon and lepton numbers are vectorial symmetries. It is thus possible 
to choose the regulator such that they are not anomalous. However, for the 
electroweak theory, we have to choose a regulator which makes the (nonvec- 
torial) gauge symmetries anomaly-free; for this regulator baryon and lepton 
numbers must be anomalous. 

We can define a lepton number current and a baryon number current by 


JM =1,y4l, + Eryier 
JBY = ayut dyd (13.43) 


The fact that the effective action is not invariant under the transformations 
corresponding to these symmetries can be expressed as the (lack of ) conser- 


vation equations 
O,J™" = 0, J?" = cold] — 2 cole (13.44) 


(It is to facilitate this identification that we kept a, @ in (13.41) as arbitrary 
x-dependent functions, even though the symmetry is global.) We see directly 
from (13.44) that there is baryon number violation (and lepton number vi- 
olation) in the standard model, if there are field configurations for which 
the integrals of c2{b] or cg[c] are nonzero. For the SU(2) gauge field b, such 
configurations exist; examples are the weak instantons. 

The gluon field does not contribute to these anomalies, but it does con- 
tribute to the axial U(1) anomaly in QCD. The axial U(1) transformation is 
given by 


,—e Gi, Qr=e*Qr 
g = e&'Q (13.45) 


This is not a perfect symmetry of the theory, even without anomalies, due 
to quark masses and other weak interaction effects. However, the strongest 
breaking of this symmetry is due to anomalies and so it is interesting to 
write this down as well. In this case, the computation is a straightforward 
generalization of (13.22) and gives 


294 13 Anomalies I 
6,I = -i an; f ded = pN wav Tr (Fu Fas) 


= i 2Ny | d*z  p{A] : (13.46) 


plA] = apace? Tr (Fur Fag) 


3272 


where A is the SU(3)-gauge field corresponding to the QCD gauge group. Ny 
is the number of light fermion flavors. (For the heavier fermion flavors, the 
breaking of the axial symmetry due to the fermion mass terms is so strong 
that separating out the anomaly contribution is not particularly useful.) The 
quantity p[A] is the density for the instanton number; the integral of [A] 
is the instanton number which is an integer. The change in the action for 
A = 2nn/(2Ny¢), where n is an integer, is given by AI’ = 2zin, so that 
exp(—I°) is invariant. We then see that, although the U,4(1) symmetry is 
broken by the anomaly, we still have a Zan, symmetry for all the physical 
results. 


13.5 The Lagrangian for 7° decay 


We have calculated the lifetime of the 7°-meson in Chapter 7 using the ef- 
fective Lagrangian term 


Ca = pr nll B ‘a : : (13.47) 


(In Chapter 7, we used the letter ¢ for the pion field, here we use 7° in 
agreement with Chapter 12.) The field 7° can be understood as a Goldstone 
boson arising from the spontaneous chiral symmetry breaking in the standard 
model. We can combine this piece of information with our calculation of the 
axial anomaly to derive the Lagrangian (13.47). 

The pion is made up of the up and down quarks only and so for this cal- 
culation we need just those two flavors of quarks. Consider a chiral symmetry 
transformation 


u — exp(i7s5y) u, d—exp(—iysy)d - (13.48) 


This may be combined as 


i ee — exp(i7sTsy) q (13.49) 


The field U representing the pseudoscalar mesons behaves like gzgr under 
chiral transformations, transforming as 


U =U’ = 91 U gi, (13.50) 


* 
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Comparing these with the expansion of U in terms of the pion field given 
in (12.90) and (12.92), we see that, when the quark fields transform as in 
(13.48), the pion field changes as 

nm + 7° + Qfpp (13.51) 


The up and down quarks have electrical charges Ze and —ie, respectively, 
and there are three colors of each which contribute to the term in the anomaly 
with the electromagnetic field. Keeping these in mind, we find, from (13.22), 


e oad ie 
Oot = gee pes (od pred OF (2) = (3) xs 
a or 2 
= iS / dé oF Faw Ew : (13.52) 


This integrates to give the effective interaction term 


- a. - 
/ ip — ar fy [as id Pe nye 
a gee. | gies - Bae 
are ds E-Br (13.53) 


This calculation has been done in Euclidean space; exp(—I°) continues to 
Minkowski space as exp (i f an with Line given by (13.47). 


13.6 The axial U(1) problem 


We have seen that the approximate chiral symmetry SU(3), x SU(3)p for 
the three light flavors of quarks is spontaneously broken due to the nonper- 
turbative effects of the gluon fields. If the chiral symmetry were a perfect 
symmetry of the Lagrangian, this spontaneous breaking would lead to mass- 
less Goldstone bosons. Since the starting Lagrangian does not have the full 
chiral symmetry due to mass effects, weak interactions, etc., the potential 
Goldstons bosons become massive, but remain light compared to the scale 
of the strong interactions. They are pseudo-Goldstone bosons. The effective 
Lagrangian we wrote down in equations (12.91, 12.96) capture these features 
of the mesons. There is, however, one puzzle in this regard, whose solution 
requires the axial U(1) anomaly; this is the so-called U,4(1) problem. Since 
U(3), x U(3)pR is broken down to U(3)y, we expect nine Goldstone bosons. 
The candidate meson corresponding to the U4(1) generator is the 7’ meson 
which has a mass of approximately 958 MeV, which is significantly higher 
than the masses of the octet of pseudoscalar mesons. This is puzzling, but 
not a contradiction yet, since all these mesons get masses due to the fact that 
we have an imperfect starting symmetry. However, one can do a calculation 
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of the mass of the 7 allowing for quark masses and this leads to the result 
My < V3 m,. This inequality is badly violated in nature, this is the U,(1) 
problem. Clearly something is wrong about assuming that the violation of 
axial U(1) symmetry is not stronger than the scale of quark masses. 

The anomaly of the U4(1) symmetry provides the answer. Even though 
the classical Lagrangian does not show such a lack ef symmetry, it is vio- 
lated at the scale of strong interactions due to the gluon fields. Thus one 
cannot really interpret the 7’ as a psuedo-Goldstone boson. Under an axial 
U(1) transformation, the meson field U transforms as U — exp(2iy)U. The 
variation of the action given in-(13.46) then shows that the effective action 
at the level of the mesons must contain a term 


Soe 5 (log det U — log det Ut) Cae) 
2N 
= f n! plA, x] (13.54) 


(This is now written in Minkowski space.) If the two-point function for the 
instanton density p[A] has the form 


(o(a)p(y)) = mpd (e@—y)+O(0) (13.55) 


then the 7’ gets a mass term 


(13.56) 


1 [2Nymé 
Sry mass — 3 f | n? 


fi 


This would be the extra mass for the 7’ due to the anomaly and can provide 
a solution to the U,(1) problem. This formula for the 7’-mass is known 
as the Veneziano-Witten formula. The relevant two-point function can be 
estimated in some nonperturbative approaches to QCD, such as lattice QCD. 
(For simplicity, we have neglected mixing among the mesons.) 

There is one other interesting point about this analysis. The anomalous 
conservation law for the axial vector current is 


Th = Np = 


= OPT Fy Fog - (13.57) 


The right-hand side is actually a total derivative and can be written as 
—2N;0,K", where 


ag 7) 
a BTy (4.0046 ae AnAnAa) (13.58) 


ee 
Thus, we do have a conserved current J4 + 2N;K". This might seem to 
lead back to the argument that we should have a Goldstone boson. However, 
K» is not gauge-invariant. Of course, what is relevant for this argument is 
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really the question of whether the charge f d°z K® is gauge-invariant. (Notice 
that K° is the Chern-Simons term mentioned in Chapter 10.) Under a gauge 
transformation K° transforms as 


he 
K°(A9) — K°(A) = — zg" 8 Tr(g-*8;9 Ax) 


' 


sgn Tr(g" "0:9 9-99 g~*Okg) (13.59) 
The first term on the right-hand side is a total derivative and so will not 
contribute to the charge. Thus { K ° will be gauge-invariant and lead to a 
Goldstone boson, if the second term does not contribute. For an Abelian 
gauge theory, the second term is zero and hence an anomaly in U,(1) due 
to Abelian gauge fields will not resolve the U4(1) problem. For a nonabelian 
gauge theory, there are gauge transformations for which the integral of the 
second term is not zero; it is the winding number of the transformation, as 
discussed later. Further one cannot restrict attention to only those trans- 
formations for which the integral of the second term is zero, because there 
are field configurations, specifically the instantons, for which such transfor- 
mations are needed to represent them in a nonsingular way. Thus with a 
nonabelian symmetry, and the associated instantons, we do have a resolution 
of the U4(1) problem. 
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14 Elements of differential geometry 


14.1 Manifolds, vector fields, and forms 


This chapter is meant as an elementary introduction to the ideas of differ- 
ential geometry. The emphasis will be on ideas and computations. Precise 
definitions and rigorous formulation of these ideas can be found in many 
books on mathematics. 

We will assume some basics of open sets, topology, topological spaces, 
continuity, differentiability, homeomorphism as well as some understanding 
of analysis on R”. 


Topological manifold 


An n-dimensional topological manifold M is a topological space such that 
every point p € M has a neighborhood which is homeomorphic to an open 
disc in R”. 

Recall that a homeomorphism is a continuous, one-to-one (and hence 
invertible) map. The above definition means that we can consider open sets 
U; such that M = U, U;, with the collection {U;} forming an open cover of 
M. Then 

yy : UD” 


where D” is an open disc in R”, assigns local coordinates to M. The real 
numbers corresponding to the points of D” as assigned by the map yy are 
the local coordinates. y~! exists and is continuous. Each such map ¢ is called 
a chart. 

Consider U and V, which are open sets of M, with UNV # @. We 
then have yy : U — D", py : V — D”. On the intersection, we have two 
possible choices of coordinates. One can make a transformation from one set 
of coordinates, say, as assigned by yy to the other, those assigned by yy by 
using the function yy wig This is called a transition function. The transition 
functions are continuous. 

A manifold is a differentiable manifold if all the transition functions are 
C@; i.e., they have well defined derivatives to arbitrarily high order. Most of 
the manifolds that we normally encounter, such as R”, the n-sphere S”, etc., 
are examples of differentiable manifolds. One can choose different charts for 
coordinatizing a manifold. A mapping of one set of coordinates to another is 
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a coordinate transformation. A C® transformation on M is referred to as a 
diffeomorphism. 

The one-dimensional sphere $1 (a circle) is one of the simplest examples 
of a differentiable manifold which will illustrate some of the ideas presented 
here. In this case, we can use the angle 9, which is a real number, as the local 
coordinate. However, we see immediately that we cannot cover S by a single 
chart. 6 = 0 and 6 = 2m correspond to the same point on the manifold S?. 
Since we want one-to-one map to a neighborhood in R!, this is not acceptable. 
We have to use at least two charts to provide a proper coordinatization of 
oe 
Vector fields on M 


We now consider curves on M. These are maps U; : t € [0,1] > M. The 
image of the interval [0,1] is a curve on M with the initial point p € M 
corresponding to the value t = 0. We will denote the points on the curve as 
U;(p) with Uo(p) = p. 

Let F(M) denote the set of all differentiable functions on M. Thus if 
f €F(M), f:M—R', and f is C®. We can define a vector X associated 
to the curve U;(p) at the point p as 


(X- Nl) = FFOu)) ~ aay 


t=0 


X is the tangent to the curve U;(p) at the point p. By considering all curves 
with p as the starting point, we get a vector space at p. This is the tangent 
space T,.M and X is an element of this tangent space to M at the point p. By 
considering a set of curves with different points on the manifold as the initial 
points, we get a vector field, defined for all points on M. This is equivalent 
to putting all tangent spaces at all points together to get TM = U, TM, 
which is called the tangent bundle of M. The vector field is thus obtained 
by choosing an element of TM for each p. 

From (14.1), we also see that a vector field X can be thought of as a 
mapping F(M) — F(M), given by f — X - f. In local coordinates, we can 
write U;(p) = x* + t&"(x) +--+, where x’ are the coordinates of the point p. 
(We are actually writing y(U;(p)) here since we are using local coordinates. 
When there is no possibility of confusion, we use the local coordinates without 
explicitly specifying that a chart is involved.) From the formula given above 


X-fe)=FietrWe+-y| =e 25 a2) 
t=0 


Thus the vector field is given by 
i) 
7 


X = €'(z) Fait 


(14.3) 
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in the local coordinates we have used. 0/0x’ define a basis for vector fields 
and €’ are the components of X in this basis. This basis is usually referred to 
as the coordinate basis. €’ are what we usually call the contravariant vectors. 
The definition of a vector given here is independent of the coordinates used, 
being defined entirely geometrically. 

Given a curve U;(p), we can define the vector field at the starting point as 
we have done. A question which may naturally arise is whether we can obtain 
U:(p) given €*. Locally, the answer is yes; this is the theorem of Frobenius. 
The differential equation 

% 
= E(x) (14.4) 
has a solution at least locally, given the initial condition that x*(t)|t=9 corre- 
sponds to the point p. The trajectories so defined are the integral curves of 
the vector field X. 


Differential one—forms on M 


Since T,M is a vector space, we can define a dual vector space TM. We 
denote the basis of the dual space as dz‘ in the local coordinate basis and 
define it by the rule 


a. \ 


(The bracket specifies the rule of associating basis vectors of the two vector 
spaces.) This is often called the interior contraction of 0/0x* on da). TM 
is called the cotangent space at the point p. The union of all such spaces 
gives us the cotangent bundle T*M. A differential one-form is then defined 
by giving an element of this dual vector space at each point of the manifold. 
With dz’ as a basis, a dual vector is of the form w = w;dz*, where w;(z) are 
differentiable functions on each chart U. The interior contraction of a vector 
field X = £°0/0z* on this is given by 


ie = (¢ ae wdc!) = fw; (14.6) 


By requiring this to be a globally defined function on the manifold M (i.e., 
ixw € F{M)) for all vector fields X, we are able to define an intrinsic notion 
of an element of the cotangent space. 

The components of a differential one-form, namely, w; in writing w = 
w,dz*, are covariant vectors. 


Coordinate invariance 


Consider a coordinate transformation from a set «* to another set y’, 
which are specified as functions of the z’s. For a function f € F(M), the 
function in terms of x is given by f(y) where we substitute for y as a function 


of, i.é., f(z) =f w) . This gives 
y=y(x) 
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wail ty) = or (#5) (14.7) 


Vector fields aud one-forms are defined in a coordinate-invariant way. Thus 


= OP =8W) 55 
w = w;(x)dx* = 0;(y)dy?y ~*~ (14.8) 


This tells us how the components transform. Using (14.7), we find, for a 
vector field, 


X= ae =e We) ie 


=b0%; (149) 
which gives 

Bu) =e) (34) (14.10) 
In a similar way 

@;(y) = wi(z) ( a) 3 (14.11) 


It should be emphasized that the vector fields and one-forms are evaluated at 
the same geometrical point in all this; only the coordinates used to describe 
the point have changed. 


Differential k—forms 


Starting from T,M and taking the k-fold tensor product, we get (T,M)*. 
An element of this space will define a tensor of rank k at the point p. The 
collection of such tensors over the entire manifold will give a tensor field of 
rank k. In a similar fashion, we may take the tensor products of the cotangent 
spaces and define dual tensors. 

There is a class of dual tensors which play a special role in differential 
geometry. These are obtained by taking the antisymmetrized tensor products 
of T*M. A differential k-form is then given by an element of the antisym- 
metrized product A,(Z;7M) for all points of the manifold. (The wedge symbol 
denotes the antisymmetrized tensor product.) In local coordinates, a k-form 
w will be given by 

w= - Wixig--i, da" A da? A--- A dat (14.12) 
where the components w;,;,...;, are antisymmetric in the k-indices. 

We can also define an exterior (or outer) product of differential forms by 
taking a wedge or antisymmetrized product. If a is a k-form and # is a p- 
form, the product a A G is a (k + p)-form, which is given in local coordinates 
by 
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] ; 
a= Fi ink dz*! A-+- dat 
il ; 
B= piri dx“ \---dz'? 
11) . : 
an a KE! p! OG. th Dia “tk+p] da" A ls dx**+P (14.13) 


where the square brackets indicate antisymmetrization of the indices enclosed. 

Because of the antisymmetrization, all k-forms on a manifold of dimension 
n vanish for k > n. Thus the differential form of the highest rank is an n-form 
on an n-dimensional manifold. 


Pullback of a k—form 


Consider a map from a manifold M to N, y: M — WN. If x denote the 
coordinates on M, then y(z) are the coordinates of the image of M under 
the map y. Let w be a differential k-form defined on N, which can be written 
in local coordinates y on N as 


se awinies i, (y) dy™ A dy? A+++ A dy™* (14.14) 


We now consider w on the image of the map y. This can be considered as a 
k-form on M as follows. For the y’s, we substitute the map y(z) to obtain 


gw = kl =e in (P(x) dp(a)* A dy(z)? A+++ A d(x)" 


41 tk : 
— aWinianty (y(z)) (3 ) see (55 . dx) ae A daxi* 


Ori Oxisr 


(14.15) 


y*w is a differential k-form on M. It is referred to as the pullback of the 
k-form w on NV by the map ¢. 


The exterior derivative 


The exterior derivative, denoted by d, is a differentiation operation on 
differential forms, which takes k-forms to (k + 1)-forms. In local coordinates 


dw = A Bwixinwis te ded \ dai A dr? A-++ A dx 


kt! Oxd 
1 ; ; ; i 
a ei i ae Nae Noe NOL ee 
(k +1)! 1412°** Tet h+1 
a a 4 
Wirig-tningy => ora Whois -te41 —, Ania Aa iris: rue ~—f (- 1) Ag Fees Ao Wizie---tz 


(14.16) 


Applied to a zero-form (which is a function) and a one-form, this becomes 
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of. 
af = Hae 
diy = om dz) A dz’ 
Ox3 
= 5 (Bij — Bjun) dot A der! (14.17) 


From this, we see that the exterior derivative is the generalization of the 
familiar curl of a vector. 
Two of the most important properties of the exterior derivative are 


ie =) 
d(aA B)=daANB+(-1)¥aAdB ~ (14.18) 


where a is a k-form. The first of these is obvious, since 0,0; — 0;0; = 0 on 
any well-defined quantity. The second is also easily checked from the anti- 
symmetric property of the wedge products. 

It is interesting to see how the antisymmetrization in the definition of the 
exterior derivative is important for the coordinate invariance or invariance 
under diffeomorphisms of M. Consider a one-form w which can be written in 
local coordinates as w = u;dz* = & dy’. For the exterior derivative, we can 
write 


_ [ 00; Oy™ dy' ee O7y? 
~ | dy™ Oxi Axk * Ax xk 


| az na! 


rt dy* A dy’ (14.19) 
We see that the form of the derivative is the same in terms of the y-coordinates 
or x-coordinates. This invariance is possible only because the term involving 
the second derivative of y* cancels out in this expression due to the antisym- 
metrization in j7, k. This shows the importance of the antisymmetrization in 
defining the exterior derivative. 


Integration of forms 


One of the reasons that differential forms are important is that they can 
be integrated over submanifolds of suitable dimension. Consider an n-form 
on an n-dimensional manifold. For the product of the dz’s we can write 


dz A+++ Adgin = eiiavin gna (14.20) 


where ¢*1#2"'» is the purely numerical Levi-Civita symbol which is antisym- 
metric under exchange of any two indices. It has the value zero if any two 


14.1 Manifolds, vector fields, and forms 305 


indices have the same value and e!?""" = 1. If iji9++- in is a permutation P 
of 12---n, e4%2°"* is 1 if P is an even permutation and —1 if P is an odd 
permutation. In (14.20), d"x is the standard integration measure for the n 
real variables, x1, x?,---, x2”. 

If w is an n-form, we can write, locally, 


' 1 : 
aS nl vitia tn AA oes 
: = 3s ; 
= Ake natanetin giitavin de 
sa: 4.21) 


This is of the form of a function p with the integration measure d”z for the 
region in R” which corresponds to the neighborhood on M that we are con- 
sidering. We can thus integrate this by the standard techniques of integration 
over n real variables. We can do this for each neighbourhood of M to get the 
integral of the n-form w over M. In putting together the values for the various 
neighborhoods, we have to use the transition functions, which is equivalent to 
a coordinate transformation. Suppose we use the coordinates x on one patch 
and the coordinates y on a neighboring patch. On the intersection region 


Ox! Oain 


4 tn enone j cee in 
ge A= Nae = TE ae A dy! 
On" Oa"" pus, 
= ook q 9n Tr 
a Oyh OyIn e : d y 
Ox” i 4 

= aet (5) dy" A-++Ady 
= J dy A.» Ady (14.22) 


J is the Jacobian of the transformation. Since w;,;,...;, transforms by the 
inverse Jacobian, the value of w is the same irrespective of which coordinates 
are used, and so we can extend the integration over all of M. 

There is one complication that could arise in this integration. Consider 
the computation of the integral of an n-form which has p(x) = 1 on the 
intersection region of two coordinate patches. In integrating over this region, 
we can use either the x-coordinates or the y-coordinates. We then get the 
coordinate volume as measured by the 2’s or the integral of Jd”y over this 
region. If the Jacobian is not positive, we can get inconsistent results for the 
volume of the region of integration. A manifold for which it is possible to 
choose the transition Jacobians to be positive is said to be orientable. We 
shall be interested in orientable manifolds in what follows. 

If we have a k-form, we can integrate it over a k-dimensional subspace or 
submanifold. Neighborhoods on the submanifold can be mapped to regions 
of R*, by an invertible map. The restriction of the k-form to the submanifold 
can then be pulled back to the region of R* and integrated as an integral 
over real variables. 
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The exterior algebra of forms 


Differential forms of the same degree can be added to get another form 
of the same degree; we can also take arbitrary linear combinations with real 
coefficients. (Generally, the coefficients can be in some field of numbers, but 
we shall use just real numbers in our discussions.) 

The wedge product of two differential forms is again a differential form. 
The exterior derivative of a k-form gives a (k + 1)-form. Under these oper- 
ations, the differential forms on a manifold form a closed algebra, which is 
known as the exterior algebra of M. 

Differential forms, their algebra of exterior (or wedge) products and the 
exterior derivative are somewhat special in terms of analysis on a manifold 
M. We have seen that k-forms can naturally be integrated over k-dimensional 
subspaces. This is one of the main reasons that differential forms are impor- 
tant. We have also seen that the antisymmetrization in the derivative makes 
possible a coordinate independent definition of a derivative. The important 
point is that up to now we have not introduced the notion of a metric or any 
other additional structures on the manifold M. Differential forms and the 
exterior derivative are independent of any such structure. They are thus very 
useful in expressing some of the topological properties of the manifold. 


Exact and closed forms 


If dw = 0, then w is said to be closed. If w = da, then w is said to be 
exact. Obviously, exactness implies closure, since w = da > dw = d?a = 0 
by virtue of d? = 0. The converse is in general not true. The Poincaré lemma 
is the statement that every closed form is locally exact. In other words, if 
dw = 0, then w = da for some a in a local region. In general, this will not 
hold globally. 

For example, on M = S' with the usual angular coordinate 6, the one- 
form dé is evidently closed, but it is not exact, because the zero-form whose 
derivative it is, namely, 9, is not globally defined because of the problem of 
single-valuedness discussed earlier. 6 is well defined on a local neighborhood 
and so Poincaré lemma is also illustrated by this example. By contrast, the 
one-form sin @ dé is also closed and we can write sin@ d@ = d(—cos@); cos8 
is a well defined function on S', and so sin@ d@ is exact as well. 

As another example, consider the 2-form which is given in the usual an- 
gular coordinates as sin@ d@ A dy on M = S?. This is closed but not exact. 


Stokes’ theorem 


If w is a k-form and C is a (k + 1)-dimensional subspace of M 


[ ay = f Ww (14.23) 


where OC is the boundary of C. This result is a generalization of the usual 
Stokes’ theorem of three-dimensional vector analysis. It can be proved in the 
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same way as the usual Stokes’ theorem is proved. If C is a closed manifold, 
i.e., its boundary is null, then f,, dw = 0. 

Stokes’ theorem is very useful in operations involving differential forms. 
One can even use it to-prove the nonexactness of a differential form in some 
cases. For example, consider w = sin@ d0 \ dy on M = S?. By direct evalu- 
ation of the integral 


/ w = Ar (14.24) 
S2 


w is evidently closed, since d@ A dé = 0. If it were exact, say, w = da, then 


one could write 
/ v= | 3 i (14.25) 
s2 as? 


since the boundary of S$? is null. Therefore, the previous result (14.24) shows 
that w cannot be exact. Thus w = sin6@ d@ A dy is a closed, but not exact, 
2-form on S?. 


The volume form and the Hodge dual 


Consider a Riemannian manifold M with a metric or distance function 
given by ds? = g;;dx'dz). The volume element on this space is given in these 
coordinates by /det g d"x. We can write this as an n-form V on M which 
is called the volume form. Explicitly 


1 i Q 
V= 71 V det g Casa) Ne /\ ee (14.26) 


where €;,...;,, is again the purely numerical Levi-Civita symbol. 

If w is a k-form on an n-dimensional manifold M, then we can define an 
associated (n — k)-form, called the Hodge dual of w and denoted by *w, by 
using the volume form. Given w and its local components wy,...;,, we can 
define a contravariant tensor associated with it, using the metric, by 


i Se Mi 6 O 
eae Oe as a WK. 
Tu k! Oxte = Ogi 
aw re) 0 
a ne 11 gi Fis, ie Bote are aaa (14.27) 


g) is the inverse to the metric defined by gg; = 5. Notice that we have 
reversed the order of indices for the 0/0zx’s; this will avoid some unpleasant 
minus signs. The contraction of this tensor with the volume form is the (n—k)- 
form *w. In local coordinates 


*w = cae V) 
1 1 fe oe: 
a (n—k)! —k)! E / det g Gini o f ae aha 


xdxit+t A+. A dan 
(14.28) 
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From this definition we can check immediately that *(*w) = (—1)*(?-*) w 
with Euclidean signature for the metric. (With Minkowski signature when 
det g is negative, we have *(*w) = —(—1)*("-® w, ) 

The definition of the Hodge dual depends on the metric; it is not a metric- 
independent quantity. 

The Hodge dual also gives a notion of an inner product of forms on a 
manifold. Let a and ( be two differential k-forms on'‘M, which we will take 
to have Euclidean signature for the metric. We can then define an inner 
product of these two forms as 


(a, ) = [ane 


1 se ae 

=— { Vdetg aj,..2,5;,..3, 9° oe (14.29) 
Rl 

= (G,a) 


Using this inner product, we can define an adjoint 6 for the exterior derivative 
d by 
(a, dy) = (da, 7) (14.30) 


where a is a k-form and 7¥ is a (k—1)-form. The operator 6 is defined only on 
forms for which the inner product (14.29) is finite. This is the case for compact 
manifolds of finite volume and also for suitably restricted class of forms on 
manifolds of infinite volume. While d is a mapping from k-forms to (k + 1)- 
forms, 6 is a mapping from k-forms to (k — 1)-forms, as is clear from (14.30). 
By using the definition of the Hodge dual we find 6 = (—1)"**+"+1 *d*. In 
local coordinates, 


1 ji i i 
1a = “kD! 0; (9? 3) dz*? A---Adz** (14.31) 
We see that 6 is a generalization of the notion of the divergence of a tensor. 
Another operator of interest related to these is the Laplace operator A 
defined by 
A=d6+6d (14.32) 


The Laplace operator is a mapping from k-forms to k-forms. A differential 
form w for which Aw = 0 is said to be a harmonic form. 


Lie derivative 


Another differential geometric concept which is very useful is the Lie 
derivative. A transformation x‘ — z' + €*(x) on a manifold M can be con- 
sidered as being generated by the vector field € = & — In particular, for a 
function f on M, the change under this transformation is given by 


jefe as =Lef (14.33) 
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This is called the Lie derivative of f with respect to the vector field €. In the 
case of a differential one-form A = A;dz’, there will be two sources of change, 
due to the change of A;(x) and due to the change of dz’ or the change in the 
basis with respect to which the components A; are defined. We thus find 


IgA = oH oo aa’ i+ Ay us da 


= rz] on — 0; A;)dx* + 0;(A;€7)da* 
ied A + d(teA) = (igd+ die) A (14.34) 


Il 


where i¢ denotes the interior contraction of the vector field € with A, which 
was introduced in equation (14.6). Notice that the formula (14.33) for the 
Lie derivative of a function may also be written as (i¢d+ die) f, since ief =0 
anyway. For a differential k-form, by virtue of the antisymmetry of indices, 
ig is defined by 


: F il 
igw = t¢ (Fst “pda? 1 A dat? .--A Ade‘) 


= El (ep aeangedn --->+Adz’* — TEP ere te Adz +-+-Adx** +-- -) 
(14.35) 


With this definition, it is easy to check that for any differential form 


Le = ied + dig (14.36) 
32; being dual to da’, we find 

a 0&3 A 

59 Re ears 

cE gas oa Ox* Ox'* Ox) 
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The Lie derivative is given by the commutator of the two vector fields. 

In summary, we see that the Lie derivative gives the change in any quan- 
tity under the action of a vector field, taking account of the change in the 
frames used to define the components and the change in the components 
themselves. Thus it is a notion of the derivative with a direct physical mean- 
ing. It gives the change of any quantity as we move from a point to an 
infinitesimally nearby point by the action of the vector field. 
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14.2 Geometrical structures on manifolds and gravity 


14.2.1 Riemannian structures and gravity 


We have used the notion of a Riemannian metric on a manifold in a few places 
in our discussion. In this section, metrics and other geometrical structures on 
a manifold will be considered in some more detail." * 

A metric is a measure of distance between nearby points on a manifold. If 
the coordinate separation is dz“, we define the square of the distance between 
the points as 

ds* = gy,dx"dz” (14.38) 


guv is a symmetric covariant tensor; it characterizes the geometry of the 
space. We also require that the metric be nonsingular, i.e., g,,, considered 
as matrix, should have det g # 0. 
Since g,, is a symmetric matrix, it can be brought to the form (BOP) pos 
explicitly 
Guv = dd EY (14.39) 


where a is an index taking values 1 to n on an n-dimensional manifold. E/ 
is invertible since g,, is nonsingular. The quantities E* = Eidr* are a basis 
of one-forms on the manifold and are called the frame fields. Here we are 
taking a metric of Euclidean signature so that g,,, is a positive matrix. For a 
space of Lorentzian signature, the frame fields are defined by gy, = E4n®E? 
where 7% = diag(1, —1, —1, —1); we will continue with Euclidean signature, 
most of the results are easily continued to Lorentzian signature. (Rather than 
starting with the definition of the metric, as we have done here, one could also 
take the frame fields as the fundamental objects and introduce the metric as 
a derived notion.) 

There is an ambiguity in the identification of the frame fields given the 
metric tensor g,,. Notice that E? and R2°E°, where R® is a local rotation 
matrix, give the same metric. The rotation group SO(n) should thus be a 
gauge symmetry in all of our considerations on the geometry of the space. 
Therefore we introduce a gauge potential or connection w® = w®*dzr" for the 
rotation group. This is a one-form taking values in the Lie algebra of SO(n). 
The torsion T® and the Riemann curvature R® are then defined by 


dE* wu" ABST 
diy? Ts A eee (14.40) 


Here dE? +w® A E? is the gauge-covariant exterior derivative of E*. The con- 
nection w, usually known as the spin connection, transforms under rotations 
as Ww? — (uw!) = R%*we4(R-1)4 — dR (R-1)°. The curvature and torsion 
are gauge-covariant two-forms; the torsion transforms under rotations as the 
vector representation of SO(n), while R® takes values in the Lie algebra of 


SO(n). 
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A manifold M endowed with metric and connection as described above 
is said to be a Riemannian manifold if we have the further condition of zero 
torsion, T° = 0. In such cases, the spin connection is determined, up to 
the usual freedom of gauge choice, by the metric. For, if T* = 0, in local 
coordinates we may write 


(O,.E 7+ weE?|] — [d,£2+wE*] =0 (14.41) 
We then introduce Le by 
0, ES + wt? BP —-T, ES =0 (14.42) 


5 ey is symmetric in yz, v by (14.41). From the definition of the metric, keeping 
in mind the antisymmetry of w®° in a,b, we get 


OaGuv — Te, 9x — gua =0 (14.43) 


Writing out this equation with permutations of indices and adding terms 
suitably, we find 


L a 
oe — eld (OMav a Odean a Oat) (14.44) 
We can now go back and write equation (14.42) as 
(et) re (eae (14.45) 


We see that, for a Riemannian manifold, the metric g,,, determines LS since 
it also determines Ei up to gauge rotations, war can be constructed from the 
metric. hae are known as the Christoffel symbols. 

The Riemann curvature of a Riemannian manifold may be simplified, 
using the expression (14.45), as 

Re eee Rive 
ae Hcg OE iy. + Updos Arcade (14.46) 

As mentioned earlier, the frames £% define a basis for one-forms or the 
local cotangent space. One can regard Ei and (E~')?* as transformation 
matrices connecting local coordinate frames and local tangent and cotangent 
frames. One can then use these to transform tensors from a coordinate basis 
to the local tangent frame basis and vice versa. Rs can thus be regarded 
as the curvature tensor in a local coordinate basis. The covariant derivative 
of a vector V® is given by 


(DV)? = 6,V% +w Vv? (14.47) 


Defining the components in the coordinate basis as V, = EiV°, we find 
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The left-hand side of equation (14.43) can thus be identified as the covariant 
derivative of the metric tensor; we see that the metric is covariantly constant 
on a Riemannian manifold. 

The Riemann tensor measures the curvature of space. Consider the par- 
allel transport of a vector V* around a closed curve,C. The change in the 
orientation of the vector for a complete circuit of the curve is given by 


év? =U" Vy? 


> =P exp (f urist) (14.49) 
Cc 
wy 1+ [ Rose 


where S$ are the generators of the rotation group; in the vector representa- 
tion, which is relevant here, they are given by (9% ).q = 526 — 696°. U® is 
the gravitational analog of the parallel transport operator in a gauge theory 
given in (10.32). As in that case, in the last line of equation (14.49), we have 
simplified the integral for a closed curve of small area; o is the area of the 
surface with C as the boundary. Equation (14.49) shows that R®° is indeed 
the curvature of space. 

The Ricci tensor R,. and the Ricci scalar FR are related to the Riemann 
tensor fan. and are obtained from it by contraction of indices. 


Rua = Riva 
= ote =n Ta fe Tile sabaeil ey 


The Ricci tensor R,, is a symmetric tensor, as can be checked from the 
explicit expression for it. 

According to Einstein’s theory of gravity, spacetime is a Riemannian man- 
ifold with a Lorentzian metric of the form g,, = PD a De The metric is 
dynamically determined by matter; the equation which gives the metric of 
spacetime for a given distribution of matter is the Einstein equation, which 
is 1 

Ryuv — 5 IuwR =87GT., -: . (14.51) 


Here G is Newton’s constant of gravity and T,,,, is the energy-momentum 
tensor of matter. We have continued the definition of the curvature tensors to 
Lorentzian signature in this equation. The Einstein equation may be obtained 
as the variational equation for the action 


—_ 4 
S=- “reg | V5 R + Smatter 


= a i Cabo Rat A BO eaSmatiee (14.52) 
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In the second line, we have written the integral as the integral of a four-form 
involving the curvature two-form. The variation of the gravitational part of 
the action gives 


Si= ~ / vo d‘*z [Ryda + R= gee 9g (1a63) 
The variation of R,,, involves covariant derivatives and integrates to surface 
terms since Dagy, = 0. Using 6./—g = $959” and the result (3.107) for 
the variation of the matter part, we can check equation (14.51). 

The coupling of matter to gravity is achieved by the gauge principle of 
replacing ordinary derivatives by covariant derivatives. Thus for a scalar field 
y coupled to gravity the action is 


il 
So i /-9 d‘z E go" On pOvye — sme ee (14.54) 
For the Dirac theory, the corresponding action would be 
Sime: = ‘i V—9 diz p [iy?(E~*)™ (0, — iw?S*) —m] p (14.55) 


Here S® is the spin matrix for the Dirac spinors, $°° = —[y?, y°]/4%. For the 
gauge theory, the action is 


Sym=-- if V—9 Bax gig’? Fe, (14.56) 


F = dA+ AAA does not involve the metric, and so one need not use 
derivatives which are covariant with respect to coordinate transformations or 
local rotations. If one uses the explicit formula and covariant derivatives in 
the coordinate basis, the Christoffel symbols cancel out. 


14.2.2 Complex manifolds 


Given an even-dimensional differential manifold, one can ask the question 
whether it is possible to combine the coordinates into complex conjugate 
pairs. This question is motivated by the fact that in two dimensions, it is 
often very useful to combine the coordinates into complex coordinates z = 
x21 — iva and Z = 7; + 1x9. If T, denotes the tangent space at a point (with 
coordinates x) on a general 2n-dimensional manifold, an almost complex 
structure J is defined as a tensor field which gives a map T, — T;, for 
each z, such that J? = —1. In local coordinates, this means that we have 
a set of quantities J# such that J¥¢J& = —6d¥#. If we have such a structure, 
then we can define the holomorphic coordinate differentials (J? +7 6%)dz”. 
(Only n of these are independent.) Thus J is basically a rule to combine the 
coordinate differentials into complex components. (Even though an almost 


314 14 Elements of differential geometry 


complex manifold can be shown to be orientable, it is not automatic that 
such a structure can be defined on any even-dimensional manifold, S* is a 
counterexample. ) 

An almost complex structure gives a rule for combining differentials point- 
wise, but under certain conditions, the structure J can be integrated to define 
holomorphic coordinates in a neighborhood and eventually to define a com- 
plex structure for the manifold. The idea is that we can then define the notion 
of holomorphic functions and also achieve a separation of differential forms 
into different types. (The different types of differential forms may be labeled 
as (p,q), which indicates forms which have p differentials of the holomorphic 
type and q differentials of the antiholomorphic type.) They will remain of the 
same type under holomorphic change of coordinates. The integrability con- 
dition can be obtained as follows. Starting with the holomorphic differentials 
o4 = (Jl + i64)dx”, we can write 

do" = Ab o%o? + Bh ,o%a" + Ch ,a%a" (14.57) 
If there is a consistent separation into holomorphic and antiholomorphic 
forms, the derivative of (1, 0)-form can generate a 2,0)-form and a (1, 1)-form 
but not a (0,2)-form, since we have a holomorphic differential to begin with. 
This means that the derivative in (14.57) cannot have a term proportional 
to °G*; i.e., we must require CoB to vanish as the necessary condition for 
defining a complex structure using J. This can be reduced to the vanishing 
of the Nijenhuis tensor 


Ne, = JG, Je = 6,07) =O, Je on (14.58) 


The Newlander-Nirenberg theorem says that a manifold with an almost com- 
plex structure J satifying N/,, = 0 can be given a complex structure, if J 
obeys certain smoothness conditions such as being C™. Again, integrability 
is far from automatic; there are many examples of manifolds with almost- 
complex structures which are not complex manifolds. 5° is a famous ex- 
ample of a manifold with a nonintegrable almost-complex structure. In fact 
any almost-complex manifold of dim > 4 can always have a nonintegrable 
almost-complex structure, for even if one is given an integrable one, it can 
be perturbed to obtain a nonintegrable one. 

Of special interest to us are Kahler manifolds, which are defined as follows. 
Let M be a 2n-dimensional manifold with a complex structure J (so that 
J satisfies Nj), = 0) and a Riemannian metric g,,,. The metric is said to be 
hermitian if 


Gao Je Jp = Jap (14.59) 
Given a hermitian metric, the fundamental two-form 2 is defined as 
iL 
Q= 5 Ji Gow dx" A dx” (14.60) 


14.3 Cohomology groups 315 


(The definitions (14.58) and (14.60) are in terms of local coordinates, but 
can be easily written in more invariant ways.) Now given g,,, J and 2, M 
is said to be Kahler if d§2 = 0. In this case, 2 is the Kahler form, Guy isethe 
Kahler metric. For a Kahler manifold, one can choose complex coordinates 
222° suebithat 


ds? = Gapdz°dz” 
2 = 5 95dz" A dz (14.61) 
Further, the metric can be obtained from a potential K as 
t= = K (14.62) 


K is known as the Kahler potential. Examples of Kahler manifolds are Rie- 
mann surfaces and complex projective spaces CP”. There are many manifolds 
which are complex but not Kahler, S* x S$! is a simple example. 


14.3 Cohomology groups 


We have seen that there can be closed forms which are not exact. Since 
Poincaré lemma shows that locally every closed form is exact, the existence of 
closed but not exact forms is related to nontrivial global topological properties 
of the manifold. The cohomology classes capture this feature of a manifold. 

We define an equivalence relation that two forms are equivalent if they 
differ by an exact form; i.e., w; ~ we if there is a form a@ such that wy = w2+ 
da. The equivalence classes defined by this relation are called the cohomology 
classes. More specifically, let C*(M,R) denote the set of closed k-forms on 
a manifold M, and let Z*(M,R) be the set of exact k-forms on M, where 
linear combinations of forms are taken with real numbers as coefficients; this 
is indicated by R as one of the arguments for these sets. We then define the 
k-th cohomology group of M over the real numbers as 


H*(M,R) = C*(M,R)/Z*(M,R) (14.63) 


(Cohomology classes can be defined with complex coefficients, Z2-valued co- 
efficients, etc., but we do not consider them here.) 

The cohomology classes by themselves do not give a complete charac- 
terization of the topology of the manifold; there are many other quantities 
needed to do that. 

The two cohomology groups H°(M, R) and H"(M, R) for an n-dimensional 
manifold M are of some special significance. Let M be compact and ori- 
entable without boundary and let V be the volume form. Since ¢ mV #0,V 
cannot be exact. V is evidently closed for dimensional reasons. Thus V is an 
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element of H"(M,R). In fact, up to multiplication by a constant, it is the 
only element of H"(M, R), for if w is another element of H"(M, R), we can 
write w = c V; then c has to be constant due to dw = 0, dV = 0. Therefore 
we may write H"(M,R) = R, with V as its representative element. 

On a compact manifold with no boundary, we can also define an inner 
product between elements a € H"(M,R) and § € Hr-E(M, R) by 


(a, 8) = a aNB © (14.64) 


We see that the changes a — a+ dy, B — 6+ dé leave this invariant by 
partial integration, so that this is really an inner product on the cohomologies. 
(The restriction to compact M with no boundary is to ensure that the inner 
product exists and the partial integration can be done. If we consider forms 
with sufficiently fast fall-off behavior at infinity or at any boundary M may 
have, these conditions can be relaxed.) Poincaré duality on the cohomology 
groups is the statement that H*(M,R) and H"—*(M,R) are duals to each 
other as vector spaces under this inner product. For a compact manifold with 
only one connected component, we define H°(M,R) = R so that Poincaré 
duality holds for k = 0 as well. 

The dimension of H*(M, R) is known as the k-th Betti number b; of M. 
The Euler number of a manifold is given by x(M) = >>, (—1)*dimH*(M, R) = 
>, (—1)*by.-. For a two-dimensional sphere for which H!(M, R) = 0, x(S?) = 
2; for Riemann surface with genus g, x = 2 — 2g. (It is easy to see that 
H'(M,R) = 0; if it were not so, there would be a one-form a, for which 
da = 0. The integral of a over any closed loop on S? is then an invariant 
under deformations of the loop. Since every loop on S? can be contracted to 
a point, the value of the integral is zero. This implies that a must be exact.) 


Cohomology of Lie groups 


The cohomologies of Lie groups are important for many questions in 
physics. These were worked out many years ago by Borel, Hirzebruch, and 
others. We will not give the general analysis here but will go through a simple 
construction of the corresponding differential forms. 

Consider a Lie group G, a typical element of which may be denoted by g. 
There are many parametrizations possible for g. For example, we may think 
of it as given in the form g = exp(it°@*), where 6° are the continuous group 
parameters and t° are matrices which give infinitesimal generators of g in 
some matrix representation. For most of this discussion, t® may be taken to 
be in the fundamental representation. Thus, for the group SU(n), t® may 
be considered as a basis of hermitian, traceless (n x n)-matrices, the index 
taking values 1,2,---,(n? — 1). Given g, we can construct the Lie-algebra- 
valued one-form 

w= g"dg = —it?E%d6' = ~—it* E* (14.65) 
where the one-form frame fields E* = E2(0)d6* were introduced in our dis- 
cussion of spontaneous symmetry breaking in Chapter 12. Starting with w, 
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by taking exterior products and traces, we can construct Tr(w*). When k is 
even, these vanish by cyclicity of the trace, e.g., 


Tr(w*) = —Tr(¢*t’)E° A EB? 
= —Tr(t’t*)E* A BE? 


=n (ghee ne 
= —Tr(w) (14.66) 
For odd values of k, 2°") = Trw* are nonzero in general. Since dw = 
d(g—'dg) = dg—} dg = ~—g~'dg g~1dg = —w?, we find 
dQ) — —Ty (w2wk? —ww? w-?.. ‘) 
ST (14.67) 


since Trw®’°? = 0. Thus 2‘) are closed differential forms. Further, these 
are not exact. The simplest way to show this is by a case by case analysis; 
we will consider the SU(n) groups. For k = 1, Q°) = Trw. This is zero by 
tracelessness of t* unless we consider U(1) for which g = exp(i@). For U(1), 
Q) = id@. Integrating this over the circle 0 < 6 < 2r, 


-i § QW) = (14.68) 


If Q@) were exact, then we would have 2) = da for some a which is 
a periodic function of @ (so that it is a proper function on U(1)), and so 
¢ Q°) = 0 by Stokes’ theorem. The result (14.68) shows that 2) cannot 
be exact. Thus it is an element of H!(U(1),R). For k = 3, we have 2) = 
Tr(g—!dg)%. For the group SU(2) we take the parametrization g = a + ibjo; 
with a? + b? + 62 + b% = 1, a,b; real. This shows that the group SU(2) is a 
three-dimensional sphere S*. We integrate 2°) over this $%. For this purpose, 
we may write 


u —l Qu; 
= = 14. 

OS wT’ i ES i) 
where wu; are three unrestricted real variables, —co < u; < 00, u? = ujuj;. We 
then find 

$ 2 = —24 x? (14.70) 
s3 


showing that 2) is not exact. Thus it is an element of H?(SU(2),R). 
2) is proportional to the volume of SU(2) for dimensional reasons and 
so H3(SU(2),R) = R. This result holds for any compact Lie group G, 
H3(G,R) # 0 for any compact Lie group, 2®) being a representative el- 
ement. 

A similar argument can be made for 25) = Trw®, where we can expect 
that demonstrating the nontriviality of 2) involves integration over a five- 
dimensional sphere $°. Such an integration requires a parametrization of 
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group elements in a way similar to (14.69), but there is a simpler argument 
one can use. First of all, notice that since the group SU(2) is S°, for this 
group Trw® must be zero. Generally 2) is of the form 


Q) = —4 Tr(t7t?teett®) E27 A--- A ES 
= == maqeag t“|[t4, t°]) B2A--- NBS - 


= Sapo (7 os ey E% A-+-A E® 

= pdt fOr FONE Aon AE (14.71) 
(For SU(2), d*™” = 0; this is another way to see that 2) should be zero for 
this group.) 2) can be nonzero only for SU(n), n > 3. Consider SU(3) first, 
which is eight-dimensional. In this case, 2°) ( 2) should be proportional 


to the volume element of SU(3). In fact, we have 


2°) -, QE) = Tr(t2ebeee4t°) Tr(tft9t") Et A--- A EP 


45 
= j——_V. 14.72 
a V3 SU(3) ( ) 
where we have used 
45 
en Te(t7toecett? Tr (tF49t") = i—— 14.73 
€ab---h ( ) ( ) “V3 ( ) 


and Vsy 3) is the volume element for SU(3) given by E? A---A E® = 
ee BV (3). We see that the integral of 2) A 2) over SU(3) is nonzero, 
since the volume integrates to a nonzero value. Since we have already shown 
that 2°) is cohomologically nontrivial, this result shows that ©) is closed 
but not exact. In other words, H°(SU(3),R) is nonzero. Notice that this 
result for SU(3) also follows from Poincaré duality, since we have already 
shown that H°(SU(3),R) is nonzero. As in the case of H?(SU(3), R), the 
dimension of H®(SU(3),R) is 1, and the nontrivial element is generated by 
2) up to constant factors. This result can be shown to be true for all SU(n), 
n> 3, 10(SU(n),R)=R,n 23. 
From the definition of 2©), we also find 


2 (gh) = 2) (g) + 2® (h) +d Tr| —5dh h~wdw — 5w(dh h-1)3 


+3dh ho wdh ho7w 


(14.74) 


If h is an element of an SU(2) subgroup of G, 2©)(h) = 0, Q©)(gh) and 
)(g) differ only by an exact form. For G = SU (3), the coset space 


. 
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SU(3)/SU(2) is S°. So in integrating 2) over S®, there are two possi- 
bilities; one may consider a sphere in SU(3) or one may consider a disc in 
SU(3) with the boundary lying in an SU(2) subgroup, this disc being a 
sphere in SU(3)/SU(2). In both cases, the integral of 2) gives a number 
characteristic of the cohomology involved, although the two results are not 
the same. ' 

Similar arguments can be made for 2°), for k = 7,9,--+. One finds in 
general that H*(SU(n),R) has one generating element 2¢*) for odd values of 
k,k = 2r+1, r=1,2,...,(n—1). For SU(n), this gives (n— 1) nonzero coho- 
mology groups. The rank of SU(n) is also (n — 1) and it has (n — 1) invariant 
tensors or Casimir invariants. This is not an accident, there is a correspon- 
dence between invariant tensors and cohomology elements. Evaluating 2¢*) 
in an arbitrary representation of G we find 


i a aR ee Nun’ 5. (aa) 


where dy“? °"** is given by the trace of t¢t¢? ---¢°"+1 with all indices sym- 
metrized, the t®’s being in the representation R. d7°?"°"*? is an invariant 


tensor of the Lie algebra of G. Thus the 2°") are, in fact, given in terms of the 
invariant tensors and suitable products of the frame field one-forms. There is 
a correspondence, known as Weil homomorphism, between Chern classes (to 
be discussed later) and Casimir invariants; the present connection between 
invariant tensors and cohomology classes for Lie groups is clearly related to 
this. 


14.4 Homotopy 


Let fi(z): M — WN and fo(z) : M — N be two continuous mappings from 
a manifold M to another manifold NV’. Suppose we can find a continuous 
mapping f(z,7) depending on a real parameter 0 < 7 < 1 such that f(z,0) = 
fi(x) and f(z,1) = fo(x). We then say that f;(x) and fo(x) are homotopic 
to each other. Basically this means that f;(x) can be continuously deformed 
to fo(x), 7 being the parameter characterizing the deformation. 

Given a number of mappings from M to N, we can define an equivalence 
relation f; ~ fe, if f; and fz are homotopic to each other. Mappings from 
M to N can then be grouped into equivalence classes where the members 
of each class are homotopic to each other. These homotopy classes of maps 
from M to N are important because there are many situations where certain 
physical quantities are invariant under smooth deformations of the mappings 
(or functions) involved. The notion of homotopy is then important in giving 
a precise characterization of their properties. 

While homotopy classes can be defined for arbitrary manifolds M and N, 
maps from spheres to a manifold NV are particularly important and useful. 
The homotopy classes of maps from the k-sphere S* to a manifold N are 
denoted by I7;,(’). We will consider these in some detail. 
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We start with IJ;(N), which is also very often referred to as the funda- 
mental group of WV. It refers to the homotopy classes of maps S' + NV. Such 
a map describes a closed loop in NV’. Homotopically equivalent maps are thus 
loops in VV which are continuously deformable to each other. J7;(\/) thus 
characterizes the loop-connectivity of NV. 

The simplest example of a nontrivial fundamental group, which is also, at 
least partially, a paradigm for more complicated cases, is that of the 2-plane 
with one point, say the origin, removed, namely, VW = R? — {0}. Clearly 
there are paths which loop around the origin which are not contractible. 
We can characterize the “nontriviality” of paths on this space as follows. 
Consider first loops which start at the point x and return there without going 
around the origin. There is evidently a composition law for these paths. If 
Co is a closed loop and C@ is another loop with the same starting point, the 
composite path Co + C@ is defined as the path which traces over Co first and 
then continues over Cj. We can also define a path C; which loops around the 
origin once. The composition C{ = C; + Cp then denotes a path which starts 
from z, goes around the origin once along C; and then goes along Co. Paths 
are equivalent if they can be smoothly deformed into each other, so Co ~ Ch 
, Cy ~ Cj, but evidently Co % C. We can also define —C as the loop C 
traversed the opposite way. Under these conditions we see that loops on the 
space have the following structure. Co +Cp ~ Co, C1 +Co ~ Ci, C1 + Ci ~ 
C2. The loops thus behave like the group of integers Z with Co acting as zero. 
In other words, [7,(R? — {0}) = Z. It has the structure of a group, with the 
geometrical loop composition represented by addition in Z. More generally, 
II,(N) are groups with the composition of maps represented in a suitable 
way. For this reason, IJ,(V) are referred to as the homotopy groups. 

Continuing with R? — {0}, the relation between loop compositions and 
the group of integers can be made more precise by defining an invariant for 
loops as follows. For any loop C from z to x introduce 


(cy = f a;da' (14.76) 
2,C 


where a;dz* is a one-form. We want v to be invariant under small deforma- 
tions of the loop C. This requires 


Oj0; — 0; 0% a (14.77) 


Further for a path which winds once around the origin, we need ¢ adr’ = 1. 
This condition, along with (14.77), determines a; for the two-dimensional 
problem as 


1 eg) 
QQ == —— = 


(14.78) 


Qa x? 


In polar coordinates (r,y), a;dx' = dy. Evidently, v(Cy) = 0, (Cl) 
1, v(Cz) = 2, v(—C,) = —1, ete. v(C) is a topological invariant for a closed 
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curve C’, i.e., it is unchanged under small deformations of the curve and is 
an integer for any closed curve C. It gives the element of J7,(R* — {0}) = Z. 
This integer measures the number of times the path winds around the origin, 
taking account of the orientation of the path. v(C) is called the winding 
number of C. 

The statements given above show an interesting relation with cohomol- 
ogy groups. If we consider a = a;dz’ as a one-form on N = R? — {0}, the 
condition (14.77) says that da = 0. Thus a@ is a closed one-form. On the 
other hand, if it is exact, a = dh for some (single-valued) function on N. 
In this case, v(C’) would be identically zero. (a = dy does not say that it is 
exact, since is not single-valued.) Thus, we are using a closed but not exact 
one-form, or an element of the cohomology group H1(N, R) to represent the 
element of J7;(\’). The composition rule shows that it is enough to consider 
elements of H'(\’,R) with the coefficients for taking linear combinations of 
forms restricted to being integers, i.e., H*(N, Z). The generating elements of 
this group are the same as H!(N,R). Because it is a restriction to integer 
coefficients of H'(N, R), the cohomology group H!(N, Z) is the group of in- 
tegers Z and a is its representative element. We are integrating this one-form 
over the loop to obtain v(C); more precisely, we use the map 2(r) : S1 += N 
to pull back the differential form on WV to a form on S! (which has coordinate 
T) and then integrate over $1. In many situations where the homotopy group 
is Z we are able to represent elements of the homotopy groups JJ,(MV) in 
terms of integrals over S* of a nontrivial element of H*(N,R) pulled back 
via the map f : S* > N. 

This relation with cohomology groups is, however, not generic. Homotopy 
and cohomology are different concepts and do not agree in general. For ex- 
ample, one has IJ3(S$*) = Z but clearly H3(S*,R) = 0 since we cannot have 
a differential three-form on a two-dimensional manifold. A few more general 
remarks about homotopy groups are the following. JJ; can be a nonabelian 
group, while H!(N,R) is Abelian; I7;,(\) for k > 1 are Abelian. In many 
cases where the homotopy group is the group of integers, it is possible to use 
elements of the cohomology group to define winding numbers which char- 
acterize the elements of the homotopy group. The homotopy group IIp(N) 
is defined as the set of connected components of NV. If there is only one 
connected component of N, IIp(N) has only one element. 


Homotopy groups of spheres 


A class of examples where the homotopy groups can be obtained in terms 
of the cohomology groups is the case of spheres for which IJ,(S”) = Z and 
II,(S") = 0, k < n—1. We use cohomology elements to write down a winding 
number which represents J7,(S”). In this case, we are considering the maps 
n*(xz): St, > SR, where n*, a = 1,2,---,(n +1), with n°n* = 1, represent 
the coordinates of the target sphere S’,. x’s represent the coordinates of the 
base sphere $%,. The volume form on the target space Sy is given by 
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dV = ee n™ dn? A dn A-+»Adnon*t (14.79) 
n! 
Pulling this back by the map n“(z), which means that we simply substitute 
n*(x) for the n®’s in the above formula, we get a differential n-form on the 
starting n-sphere S%,. By integrating this over Sj, we can define a winding 
number for the map n°(x) by ar 


! ai 2 az Gn+4+1 
a a BAT. n+l (14, 
Q|[n] fl vol(S*) ign Grpagh aa, 1 dn"* Nes = hee (14.80) 
where vol(S”) stands for the standard volume of an n-sphere, given in terms 
of the Eulerian gamma function by 


n+1 


= 21 2 
nS eee 
It is easily verified that Q[n] is indeed the same for homotopically equiv- 
alent maps. Consider an infinitesimal deformation of the map n°(z) given by 
n?(x) + 6n%(x). By direct computation we find 


(14.81) 


Q[n + dn] — Q[n] = oe (n+1) 6n™ we, —n du’ 


Wai = Caraga, die? Ne (14.82) 


ww! = Egan ae, (On mt dn N--- Aaa ae 


Since n°n° = 1 we have n°dn* = 0. The dn’s being orthogonal to n°, the n- 
fold antisymmetric product €g,a9-..a,4, ( dn®? A--- A dn°"+?) is proportional 
to n™. For the first term in the variation of Q[n], we then have 6n® n° which 
is zero since the variations 6n°(x) must also obey the requirement n*6n* = 0 
to preserve the condition n°n* = 1. The second term is a total derivative and 
integrates to zero on the closed space S{,. Thus Q[n + dn] = Q{[n], showing 
that Q[n] is invariant under continuous deformations of the map n°(z). It 
is thus a homotopic invariant. It is also independent of metrical or other 
geometrical structures on the two spaces $7, and SX, involved, since we have 
only used differential forms. 

The particular map n' = 1, all other n’s being zero, maps the sphere 
Si to a point on the target space Sf,. Evidently, Q is zero for this case. 
The map n* = z°* where x® are the coordinates of the sphere S%,, with 
xz°x? = 1, identifies the target sphere and the base sphere. For this map, 
Q = (1/vol(S")) f dV = 1. Thus any smooth deformation of the map n* = x* 
belongs to the equivalence class with winding number 1. 

Another general result of interest is IT;,($1) = 0 for all k > 2. There are 
also many intriguing results for the homotopy groups of spheres of the form 
ITn4%(S"), for example, [3(S?) = Z, II4(S*) = Ze, 11,,(S*) = Z, to name a 
few. There is no simple way to obtain these in terms of cohomology groups, 
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so we will not discuss them further. The exact sequence of homotopy groups, 
which we give later in this chapter, can be used to relate some of them to 
winding numbers as constructed above. 


Homotopy groups for compact connected Lie groups 


The homotopy groups for compact connected Lie groups G (where G = 
SU(n), SO(n), Sp(n), Go, Fy, Ee, Ez, or Eg) have all been calculated and 
are listed in standard mathematical tables. Some of the general results are 


Zo SO(2) 
P(E) we a Ae EOC Ne eo 
0, all other G 


II2(G) = 0 for all G 
(14.83) 
_ §ZxZ,G=S0(4) 
iia Ee all other G 


THor4i1(SU(n)) = Zn-—1l>r 


The result that the fundamental group of SO(3) is Za is the statement that 
SO(3) is doubly connected. This is, of course, well known and is related to 
the double-valued spinor representations of the rotation group. 

There are further general results beyond what is given in (14.83), but 
here we will just consider some cases for which we can write winding num- 
bers in terms of the cohomology elements. We have seen in our discussion 
of the cohomology groups of Lie groups that the SU(n) groups have one 
nontrivial generating element for the cohomologies H?"t+!(SU(n),R) which 
is of the form Tr(g~'dg)*"t1, n — 1 > r. We can use this to write a wind- 
ing number for the corresponding homotopy groups which is of the form 
r= ey)... Rioakig)"' fomamapg@)< ST =eSUin)paeed per, 
A being a normalization constant. This is a realization of the last result in 
(14.83). Specifically for the case of [73 we find 


Qlo) = —spe5 |, Teor" (14.84) 


Intuitively, we can understand this result by considering SU(2), which is S°, 
so that this result reduces to the previous case, J73($°) = Z. For higher 
groups, there is always an SU(2) subgroup into which the 3-sphere S° can 
be mapped. From the definition (14.84), we can easily check that 


Q[gh] = Q[g] + Q[h] (14.85) 


The extra crossterms which arise combine to total derivatives and integrate 
to zero. This result shows that if h is a small deformation, of zero winding 
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number, Q[gh] = Q[g] which is the homotopy invariance of Q. Secondly, if gi 
has winding number 1, a mapping with winding number tp is given by 9191, 
and an element with winding number —1 is given by gl. It is then easily 
seen that we generate Z from using @ to represent the homotopy group 


14.5 Gauge fields 


In this section, we will see that the formalism of differential forms give a very 
natural setting for discussing gauge theories. We start with the electromag- 
netic field. 


14.5.1 Electrodynamics 


The electromagnetic field is described by the vector potential A, which enters 
the theory via the covariant derivative D, = 0,,—ieA,. (Here we will display 
the coupling constant e explicitly in the covariant derivative, since this is 
more conventional for the Maxwell theory.) A, is thus naturally a covariant 
vector, and so we define the one-form 


A= A,dz" | (14.86) 


Applying the exterior derivative on this we find 
1 
dA = 0, A,da" \ da’ = 5 (O,A, — OLA,) dz” A dx” 
1 
= pF uvdah A da” 
=F _ (14.87) 
The field strength tensor is thus the exterior derivative of the one-form po- 


tential A. The identification of the usual electric and magnetic components 
is given by 


F = Fo; dx® Adz’ + 5Fu dx’ A dx) 
al : 
= E, dx° A da* + 3 ctik Br dz’ \ dx (14.88) 


Because d? = 0, we have immediately dF = 0. This is written out as 
1 
dF = 3! (OuFva + OoFuy + Oy Foy) dx" \ da” \ dx = 0 (14.89) 


This is the so-called Bianchi identity and is identical to the sourceless Maxwell 
equations. 
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The Hodge dual of F is also a two-form in four dimensions. The con- 
travariant tensor obtained from F of (14.88) is 


0 Oo i} om) 
Ox* Ox° ii 3 Cisk Be Ox) Oxi 


Taking the contraction of this with the volume form dx® A dz! A dz? A dz’, 
we find 


Tr = (14.90) 


Peas. 
*F = B; dx® A dz' — 9 isk Ek dx’ A dx (14.91) 


(In equations (14.88) to (14.91) we do not distinguish between upper and 
lower indices for the spatial components , all minus signs due to the Minkowski 
metric are explicitly written out and raising and lowering of spatial indices 
are done by using the Kronecker delta.) 

The current for the Maxwell theory, being a vector, can also be thought 
of as a one-form. Its dual is a three-form given by 


*J= PPr— studs dx® A dx‘ A dxi (14.92) 


It is now easily checked that the equations d*F =* J are identical to the 
Maxwell equations with the sources. In summary, the Maxwell equations are 
then 


F=dA 
d*F=*J (14.93) 


Again from d? = 0, the second equation leads to d*.J = 0. This is the conser- 
vation of the current. 
The action for the Maxwell theory can be written as 


s--5/ FA‘ F + /) AN (14.94) 
2Jm M 


For the classical electromagnetic theory, instead of starting with the po- 
tential, we could take the two-form F as the basic variable and write the 
equations as 


dF =0 
d*F=*J (14.95) 


Since F is closed, the Poincaré lemma tells us that we can write F = dA at 
least locally. This takes us back to the potential. The existence of the potential 
is only guaranteed locally, and one could have situations where we do not have 
a globally defined potential. From our definition of the cohomology group, we 
see that this can happen if the manifold has nontrivial second cohomology, 
namely, H? 4 0. An example of this is the magnetic monopole, which we 
shall now discuss briefly. 
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14.5.2 The Dirac monopole: A first look 


Consider a point magnetic monopole with magnetic charge g. Taking the 
origin of coordinates as the location of the magnetic monopole, the magnetic 
field is 


Ty 
|x|? 


This is singular at the origin |z| = 0, so, in order to have a nonsingular field, 
we must consider this as being defined on R? — {0}, i.e., R® with the origin 
removed. For M = R3 — {0}, there are noncontractible spheres we can draw 
around the origin. Thus the space is topologically nontrivial. The two-form 
F corresponding to (14.96) is 


Bea (14.96) 


By direct computation, d F = 0. (Actually, we find a delta function 5©)(z), 
but this is zero on M since it does not include the origin.) Integrating F on 
a sphere around the origin 


i ae r? sin Od0dy 
Pa] cau ee Ad? = 9 | 
I. ie aa ’ lan|? e ss of. re 


=Ang (14.98) 


Thus F is closed but cannot be exact on M. This also shows that H?(R? — 
{O}) #0. 

The sphere surrounding the origin can be described by two coordinate 
patches, one covering the northern hemisphere (x* > 0 — ¢) and the other 
covering the southern hemisphere (x? < 0+ €), with a small overlap region of 
width 2€ around the equator. On each of these patches, by Poincaré lemma, 
we should be able to write F as dA for some A. By direct calculation we find 


E€abclla Lh 
An = — c 
r(ir+n-2) g 
EabcltaLh 
A SS SS er c : 
s Gana (14.99) 


where nj = ng = 0, n3 = 1 and r? = z--z. Ay can be used for the northern 
hemisphere, it has a singularity at the south pole where n- x = —r. Likewise 
Ag can be used for the southern hemisphere and has a singularity at the 
north pole. The singularities of these potentials is a reflection of the fact that 
F is closed but not exact and so cannot give a nonsingular potential. The 
singularity is at a point (either the south pole or the north pole) and so in 
the full space M we have a line of singularities, stretching to infinity, for An 
and Ag considered as separate functions. This line of singularity is called a 


Dirac string. By using the two potentials, each in its own patch, we can avoid 
the singularity over all of M. 
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The two potentials overlap at the equator where n-z = 0. In this region 
we find 
2 d2x2 = x2dx1 


(xj + 23) 
where ¢ is the azimuthal angle. The difference in the overlap region is a gauge 


transformation with gauge parameter A = 2g y. Thus the physical field B is 
the same in the overlap region, whether computed using Ay or As. 


An = Ag = 29 = 2g dy (14.100) 


14.5.3 Nonabelian gauge fields 


We now turn to nonabelian gauge fields. In this case, the potential is also an 
element of the Lie algebra, and so we introduce a Lie-algebra-valued one-form 
A written in local coordinates as 


A = (—it") A“ dz# 14.101 
be 


where ¢® are hermitian matrices which form a basis of the Lie algebra of the 
group. As we have written it, A is antihermitian; this is convenient for many 
calculations. 

The covariant exterior derivative of a function ¢ is given by Dé = (d+ 
A)@¢. Therefore, D?¢ = (d+ A)(d + A)¢ = (dA + A?)¢ using d? = 0 and 
d(Ag) = dA¢ — Ad®@. (We will not write the wedge sign anymore whenever 
it is clear from the context.) This calculation shows that we must define the 
field strength as 

F=dA+ A? (14.102) 


Using the expression for the potential, we may simplify this as 
_ 0 *90\ AG “2b “4c\ Ab Ac Ls y 
Ye E (—it*) A? + (—it”)(—1t°) A, Al | da" A da 


a ; [(—ét*) (0,49 — 0,49) + [(—it”), (—it°)] AL Al] da A da” 


= A (it) [9,48 — 0,42 + fO°°AR.AS] dat A da” 
(14.103) 


In the second step we used the antisymmetry of the wedge product. We see 
that the field strength is indeed what we expect from our previous discussions 
about gauge fields. 

The gauge transformation of the potential by the group element g may 


be written as 
A-— A9 = gAg™! —dgg™ (14.104) 


We can directly check that F9 = gFg~' using (14.102). 
The Bianchi identity is now more involved. We find 
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dF =dA A—AdA 
=(F-A*)A-A(F- oe 
=FA-AF (14.105) 


In components, this works out to be the usual Bianchi identity 
Dy Fiat Dak p DiF p=0ee (14.106) 


The Yang-Mills action can be written as 
s--3 | F* qt F 
- Qe? M 
= = / Tr(F A* F) (14.107) 
e JM 


where we have chosen the normalization of the matrices t® as Tr(t*t?) = 56°. 
As an example of a calculation with differential forms for the nonabelian 
gauge theory, we consider the quantity 


v{ A} 


a iP Tr(F F) 
‘ Q (14.108) 
M 


Hil 


This is known as the instanton number and its relevance to physics will be 
discussed later. For now, we notice that 
at 
dQ=——~Tr(dF F 
372 ( +F dF) 


= — 25 1h((FA~ AF) F +F (FA- AF) 


=p (14.109) 


where we have used the cyclicity of trace. (For a matrix-valued k-form a and 
a p-form 3, Tr(a 3) = (—1)?*Tr(G a) as can be checked easily using the local 
coordinate expressions.) The four-form 2 is thus closed and so, by Poincaré 
lemma, we should be able to find a three-form K with 


Q=- att (F F)= (14.110) 


Since K is a three-form built out of d’s and A’s, it has to be of the form 
K =a Tr(AdA + b A?) (14.111) 


where a,b are constants. We then find 
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dK =a Tr(dAdA+6dA A? —b AdA A+b A’dA) 
= a Tr (dAdA + 3b dA A?) 
=a Tr ((F — A*)(F — A*) + 3b (F — A?) A?) 
=a Tr (F F + (3b— 2) FA?) (14.112) 


(We used TrA* = 0.) Choosing 3b = 2 and a = —1/877, we get the result 


a 1 2 3 

K= galt (4a + 3 A°) 
BoelaT aes 14.113 
G2 3 (14.113) 


K is known as the Chern-Simons three-form. 
Notice that, even though {2 is gauge-invariant, K is not. From the gauge 
transformation (14.104), we find 


K[A9] = -~ah («4 +v)F — 3(A + 0°) 


= K[A] - aot (va —v°A~ 5°] 


27a d | aT a)| + erg 7)? (14.114) 


where v = dg g~! and we have used the result dv = v?. The last term 
is at least cubic in the parameters of g and so, for infinitesimal transfor- 
mations, K changes by an exact form. The last term can give nontrivial 
contributions upon integration over three-spaces of nontrivial topology. This 
will be discussed in more detail in Chapter 16. Notice that, because K is 
not gauge-invariant, we cannot conclude that the integral of (2 over a closed 
four-manifold, namely, v, is zero. Just as in the case of the monopole, if v 
is nonzero, K will have singularities and we have to define a separate K 
for each coordinate patch, with K in different patches related by a gauge 
transformation on the overlap regions. 


14.6 Fiber bundles 


Fiber bundles are the proper mathematical notion for introducing internal 
symmetries in a field theory. Consider a set of fields ¢* which take values in 
some space F on the spacetime manifold M. Consider a neighborhood U of 
M. On this the information we need for the theory is given by (U, ¢'). We 
thus have a direct product structure U x F with ¢'(x) : U — F as a map 
from U to F. A fiber bundle is the formalization of this idea. 
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A fiber bundle over a manifold M is a manifold E which is, locally on M, 
a direct product of M and F, where F is also a manifold. Thus U; are open 
sets of M, € = U; x F for each U;. We denote a fiber bundle by the triplet 
(F,€,M), F is called the fiber, € is the bundle, and M is called the base 
manifold. Just as the coordinates of U; are related to the coordinates over U; 
by transition functions on the overlap region U; NU;, the fiber over U; will be 
related to the fiber over U; by a set of transition functions ®;; on the overlap 
region. This is a reflection of the freedom of overall coordinate changes of €. 
It may be worth emphasizing that the choice of the space F is the same over 
all of M; we cannot change F from one neighborhood to another. 

If F is a vector space, we say that € is a vector bundle. The transition 
functions are then general linear transformations of F (or a subset of them). 
If F is a Lie group G, the bundle is called a principal bundle. 

A local section s(U;) of the bundle is a rule which assigns a point of F to 
each point of the neighborhood U;; in other words, s(U;) : U; — F. This is 
precisely the notion of the field ¢*(z). 

A bundle is said to be trivial if E = M x F, so that the direct product 
structure holds globally on M. In this case, sections are globally defined. 
Effectively the transition functions ®;; for the fibers can be chosen to be 
identity. A general result is that a bundle over a contractible space is trivial. 


Examples 


1. The tensor bundles and bundles of forms 


We have earlier introduced the space of tangent vectors at each point of 
a manifold. Thus we do indeed have the local structure of a direct product 
of an n-dimensional vector space and U; for each neighborhood U; of an n- 
dimensional manifold M. This defines the tangent bundle TM, a term that 
we have already introduced. Sections of the tangent bundle are vector fields. 

In a similar way, we can introduce the cotangent bundle T*M, sections 
of which are covariant vector fields or one-forms. More generally, we can talk 
of tensor bundles of different ranks and also of bundles of differential forms. 


2. Scalar fields on M 


Consider complex-valued fields as an example. In this case, € is locally U; x 
C, sections are complex-valued fields ¢(x), which are in general only locally 
defined. If we consider two neighborhoods U and V, we need a transition 
function ®yy which relates gy and dy. Such a transition function could be 
a phase transformation, for example. This corresponds to the rule 


du (x) = ev) g(x) (14.115) 


on the overlap region UNV. If the bundle is trivial, € = M x C, ¢(z) are 
globally defined functions on M. 


3. The Mobius strip 
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This is one of the simplest examples of a nontrivial bundle. In this case, 
M = S' and F = [-1,1]. We describe S$! using two coordinate patches U 
and V, corresponding to -e << 6< m+e and r—€ <6 < 27+ ¢€, respectively, 
in terms of the usual angular coordinate. There are two overlap regions Ry = 
(7 —€,7+€) and R7; = (—«,€). € is given by the two neighborhoods U x ty 
and V x ty where t’s denote neighborhoods of F. The transition rules are 
taken as 


Ube SAV on I 
ty = —ty on II (14.116) 


There is a change of orientation for F from U to V on one transition region. 
A general section s, if we try to extend over all of M, will have a singularity 
at one point, because the value of s must have opposite signs approaching 
this point from the two sides. Thus this bundle is nontrivial. (It is also nonori- 
entable.) 


Connection on a fiber bundle 


Consider a vector bundle over M where the fiber is some N-dimensional 
vector space. Sections of this bundle are of the form ¢'(z) and can be used 
as an N-component scalar field. Strictly speaking, ¢' are the components of 
the ¢ along some chosen basis vectors, say, e; in F, with ¢ = ¢*e;. The basis 
{e;} defines a frame for F. In the case of tangent vectors, we have seen that 
vectors are properly written as X = €*(r)e;, where 


3) 


~ Oxi 


e; (14.117) 
in the coordinate basis. Likewise, one-forms are w;e*, e’ = dz’ in the coordi- 
nate basis. The components £*, w; transform nontrivially under coordinate 
changes, but the vectors and one-forms are invariant. In a similar way, the 
invariant way to write the sections of the vector bundle we are considering is 
d(x) = ¢°(x)e;(xz). The frames for F can in general be different at different 
points on M. The natural question that arises is then how the frames change 
over the manifold M. Let de; be the difference of frames at nearby points 
with coordinates x“ and x’ + 6x+. de; can be expanded in terms of {e;} 
themselves and it should also be proportional to dx". Therefore 


be; = e;A? da" (14.118) 


This defines a one-form A’ ,dz" which is called the connection one-form of 
the bundle. The derivative of a section ¢ is now given as 


do = de;¢’ + e,d¢' 
= e;(D,,¢)* dz* (14.119) 
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We see the emergence of the covariant derivative; the connection one-form 
is seen to be the gauge potential. In this way, fiber bundles naturally lead 
to the idea of gauge theories. (More generally, the frames could also depend 
on the coordinates of the fiber space F; then we would have Ai, da“, where 
z denote all the coordinates, on the base and on the fiber. This would be 
the general form of a connection on a bundle.) The field strength F' for the 
connection is referred to as the curvature of the bundle. © 

There is also arbitrariness in the choice of the frame. We could do an 
invertible linear transformation e; > e; = ex(g—')xi; this can be compensated 
for by a change in the components ¢' — ¢ = gd’. The transformation 
matrix g does not have to be constant over M. Mathematically it only arises 
from splitting ¢ into components ¢* and the basis vectors e;; the intrinsic 
quantity ¢ is insensitive to the transformation effected by g. Physically, it 
is sensible to expect that we can do a local basis transformation; it would 
be strange if we could not and if we had to align the frames in our local 
region with frames in a region light-years away before we could properly 
set up the physics. Physical results must be independent of the local basis 
transformation. This is the essence of gauge-invariance. The transformation 
o — ¢' = gd is a gauge transformation. 

The present discussion highlights the fact that gauge theories have a very 
deep geometrical origin. We have already discussed the physics of gauge the- 
ories in some detail. The concept of fiber bundles clarifies some of our dis- 
cussion and will also help in understanding the geometrical and topological 
properties of gauge fields. 

Fiber bundles provide the natural setting for all physical fields. Matter 
fields are sections of various vector bundles over the spacetime manifold, 
the fiber being complex numbers or spinors of the Lorentz group. Gauge 
fields are connections on these vector bundles. As we have already discussed 
earlier in this chapter, connections on the tangent bundle to spacetime lead 
to Christoffel symbols (or, equivalently, spin connections) and the theory of 
gravity. 

The exact homotopy sequence for fiber bundles 


There is an interesting technique for relating the homotopy groups for 
the fiber, bundle, and base of a fiber bundle which is very useful. This is the 
exact sequence of homotopy groups which gives a series of maps between the 
homotopy groups involved; it can be written as 


> Ty41(M) =i LT) = Le) — IT;,(M) a OP i) ease ee 
(14.120) 
This sequence is exact, which means that the kernel of a particular map, 
namely, those elements which are mapped to zero or the trivial element, is 
the image of the previous map. For any given k, the order of the spaces 
involved is fiber, bundle, base. The sequence is in principle infinite, but by 
focusing on subsequences, one can obtain useful results on homotopies. 
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As a simple application of this idea, recall from Chapter 12 on sponta- 
neous symmetry breaking that we can write the two-sphere as a coset space 
SU(2)/U(1) = S°/S*. This is equivalent to saying that SU(2) or S° is a 
bundle over S? with U(1) = S1 as the fiber. The sequence (14.120) around 
k = 3 becomes 


— IT3(S*) - ITI3(S*) > IT3(S?) + IIg(S1) > (14.121) 


From what we have said before, [2(S1) = 0. Thus all elements of I73($7) 
are mapped to zero. The exactness of the sequence tells us that all the ele- 
ments of J73(S*) must therefore be images of elements of J73(S°). Further, 
since [T3(S1) is zero, the only element of J73($°) which can come from a 
previous map is the trivial element. The other elements of I73(S*), which are 
not images of the previous map, cannot be mapped to zero in JI3(S7). The 
mapping J73(S°) — IT3(S*) must therefore be one-to-one. This shows that 
IT3(S*) = Z, since we already know that I73(S%) = Z. 

Another example of interest is the coset space G/H, where G is simply 
connected, so that J1;(G) = 0. Since I2(G) = 0 as well, we get the result 
IIz(G/H) = IT,(#). For example, choosing G = SU(n) and H = U(n— 
1), we get the complex projective space CP"~' = SU(n)/U(n — 1). Since 
IT,(U(n—1)) = Z, we find Iz(CP”~*) = Z. There are many examples of the 
application of the exact sequence, some of which are of interest in physics. 
We will take up some more cases as the occasion arises. 
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14.7.1 Scalar fields around a magnetic monopole 


Consider scalar charged fields in the background of a magnetic monopole. 
The radial dependence of the fields is not important for this discussion, so we 
will take M = S?. The fields are sections of a bundle with the fiber being C. 
We then have the field ¢y on the northern hemisphere and ¢g on the south- 
ern hemisphere. The intersection of these two hemispheres is the equator. 
The restriction of the fields dy and ¢g to the equator need not be identical, 
but can differ by a gauge transformation. The gauge parameter of the trans- 
formation was already worked out in the last section as A(y) = 2gy, where y 
is the azimuthal angle. Since the gauge potentials undergo transformation by 
A, the charged fields must be related by ¢n(y) = e*“) ¢g(y). This means 
that the transition function e*°“() is an element of U(1), the gauge group. 
If we take the relation ¢y(y) = e*4() ds(y) from y = 0 to y = 2z, thus 
coming back to the same point on the equator, single-valuedness of the wave 
function will require 


exp [ie (A(27) — A(0) )] =1 (14.122) 
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Using the formula for A given after (14.100), we find that this is equivalent 
to 4m7eg = 27n, where n is an integer or 


(14.123) 


mls 


eg= 


The product of magnetic and electric charges must be quantized. This is the 
famous Dirac quantization condition. It follows from the singlevaluedness of 
the wave function, but its origin is purely topological and not related to the 
dynamics. 

The strength of the monopole g is an integer multiple of 1/2e. We can 
therefore consider a monopole of charge 1/2e as the fundamental monopole 
and n as the number of such monopoles which go into making the configura- 
tion (14.96). 

If n # 0, the bundle is nontrivial since we cannot take the transition 
function to be 1. For example, try to define @y over all of the northern 
hemisphere, not just the equator, via the relation @v(y) = e°“) dg(y). 
As we take this to the north pole, we see that the function e*°““%) becomes 
singular when n # 0. Thus we will not be able to make the transition function 
equal to 1. This nontriviality of the bundle is measured by 


eF 


(— 
S2 20 


(14.124) 


14.7.2 Gribov ambiguity 


In our discussion of gauge fixing and the Fadeev-Popov procedure for func- 
tional quantization of gauge theories, we have seen that there could be the 
problem of Gribov ambiguity. This refers to the fact that there could be dif- 
ferent field configurations which obey the same gauge-fixing condition, but 
which are related by a gauge transformation. This means that we are unable 
to find a “good gauge fixing” where the gauge-fixing condition chooses one 
and only one representative configuration for all potentials which are gauge 
equivalent. This Gribov ambiguity can be given a precise mathematical char- 
acterization in the language of fiber bundles. 

We start by recalling some concepts defined in Chapter 10 on gauge the- 
ories. Let A denote the set of gauge potentials, i.e., the set of all Lie-algebra- 
valued one-forms on R* which may be taken to obey the condition of finite- 
ness for the Yang-Mills action. A point on this space is a particular gauge 
potential on R*. Further let G, denote the set of gauge transformations on 
Euclidean four-dimensional space, 


or {set of all g(x) such that g(x) > 1 as /za, — oo} 


Because of the boundary condition, these transformations are topologically 
equivalent to transformations on S*. The gauge-invariant set of configurations 
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is given by C = A/G,. This is the space we are interested in. A neighborhood 
of C is a set of field configurations which are physically distinct. A set of 
configurations in A may be thought of as being given by a set of representative 
configurations which are points on C and the set of gauge transformations 
one can perform on them. Thus we have a natural splitting 


A=Cx Gx (14.125) 


for any neighborhood in C. This leads to the structure that A is a fiber bundle 
over C with G, as the fiber. This bundle need not be trivial, so the splitting 
(14.125) is not globally true. 

We can think of gauge fixing as follows. When we do gauge fixing, we 
choose a representative potential A (obeying some gauge fixing condition) for 
each physical configuration. Thus we are specifying the physical configuration 
C and a gauge transformation gc associated to it which takes it into A. We 
have an assignment of a point on the fiber, namely, gc, for each point C' in 
C. In other words, gauge fixing is the choice of a section for the bundle A. 
If we can choose a section globally, then we have the splitting A = C x G, 
globally. There is no problem with gauge fixing and no Gribov ambiguity. The 
existence of the Gribov problem is thus equivalent to the statement that the 
bundle (G.,.A,C) is nontrivial and does not have a global section. This gives 
a precise characterization of the Gribov problem as a topological property of 
the bundle of gauge potentials. 

Our argument so far does not prove that there is a Gribov problem; to 
do that, we must prove that the bundle (G,,.A,C) is nontrivial. For this we 
need some information about the topology of the spaces involved. As we 
have argued in Chapter 10, the space A has the property that any two points 
Ai,,(z) and Ag,(#) can be connected by a straight line. This line in A is 
given by 

Aur, x) = 7 Agg(z) + (1—T) Aay(z) (14.126) 


for 0 < 7 < 1. Notice that the intermediate configurations A,,(7, x) transform, 
for all 7, as a gauge potential is expected to transform; viz., we have the 
transformation rule A% (7,2) = 7 Aj,,(z)+(1—T) Aj,,(a). A is thus an affine 


space; i.e., we can write any configuration A,,(z) = A) + &, where €,,(x) is 
a Lie-algebra-valued vector field. This shows that A is a contractible space. 
In other words, its homotopy groups are trivial. 

If A is of the form C x G, globally, then the factor spaces C and G, must 
also be contractible. The strategy of proving the Gribov ambiguity is then 
to show that the factor spaces are not contractible, so that by reductio ad 
absurdum, we can conclude that the bundle (G,,.A,C) is nontrivial. 

Consider a one-parameter sequence of gauge transformations given by 
g(s, x), where s is a real variable, with g(s,r) — 1 as s — oo. This is a closed 
curve in G, starting at a point corresponding to the identity transformation 
(at s = —oo) and ending back there. Thus homotopy classes of such closed 
curves in G, will give IT,(G,). We can also think of the map g(s,z) as a map 
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from R® to the group G with the boundary condition that g goes to 1 at 
infinity in any direction. Such maps are equivalent to maps from S° to G. 
The homotopy classes of such maps are then given by I/5(G’). We therefore 
have the result that I7,(G.) = JIs(G). Since II5(SU(n)) = Z for all n > 3, 
this immediately shows that G, is not contractible for SU(n), n > 3, proving 
the existence of the Gribov problem for such gauge groups. If the gauge group 
is SU(2), there is the result that J74(SU(2)) = Zo; this leads, via a similar 
argument, to II9(G.) = Ze, proving the Gribov problem for this group as 
well. 

The exact sequence of homotopy groups can be used to make some more 
interesting statements about the bundle (G,,A,C). If we choose k = 2, the 
sequence gives 


— [I2(A) > I2(C) > I1(G.) > Ih(A) > (14.127) 


Since A is homotopically trivial, this shows that I72(C) = Z for gauge groups 
SU(n), n > 3. For SU(2) one can similarly obtain [1,(C) = Zo. 

If we do a similar analysis in three dimensions, then, by arguments similar 
to those given above, we can relate IIo(G,.) to 113(G). Since the latter is Z for 
compact Lie groups G, (except for SO(4) for which one has Z x Z), we get 
the result that J7;(C) = Z (Z x Z for SO(4)), where C is the gauge-invariant 
configuration space of three-dimensional gauge potentials. This space C also 
corresponds to the configuration space at fixed time in the Ag = 0 gauge of 
the four-dimensional theory, which is appropriate for a Hamiltonian analysis 
of the four-dimensional theory. The result J7,(C) = Z is then related to the 
existence of instantons. 


14.8 Characteristic classes 


The Dirac magnetic monopole is an example of a nontrivial fiber bundle. 
The nontriviality of the bundle resides in the fact that F' is closed but not 
exact. We have also seen from the discussion of scalar fields on a monopole 
background that the nontriviality of the bundle is measured by f F'/2. Char- 
acteristic classes generalize this idea. We measure or characterize the nontriv- 
iality of a bundle in terms of a set of differential forms constructed from the 
curvature F (or the Riemann curvature R in the case of gravity). There are 
a number of characteristic classes which are relevant for different questions 
of topological nature. We shall list some of them here. 


Chern class 
This is relevant for a complex vector bundle on a manifold M. Let F be 
the curvature of the bundle. The Chern classes are then defined by 


c(F’) = det (1 ae i=) 
=1+4+ o(F) + e(F) +--- : (14.128) 
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c; is callled the first Chern class; cz is the second Chern class and so on. 
Using 


det(1 + 7X) = exp[Trlog(1 + iX)| 
= 1+ TriX)+ ; [TreX TriX — Tr(iX)?] +++, 
(14.129) 


we can work out the c;’s. F is a Lie algebra valued two-form and in expanding 
the formula (14.128), the wedge product and matrix product are understood. 
We thus get 


1 
c1(F) = 52 Jiggy 


eo(F) = => [Ir(F AF) — (TrF) A (TrP)] (14.130) 


aq | 


For an SU(N) gauge field, TrF = 0, so that 
1 
co(F) = aaa Tr(F A F) (14.131) 


det(1+72X), for arbitrary X in the Lie algebra of a group is the generating 
function for the Casimir invariants; thus, the Chern classes are all seen to be 
proportional to the Casimir invariants of the Lie algebra. This is the Weil 
homomorphism. 


Chern character 
This class also pertains to a complex vector bundle. The Chern character 
Ch(F) is defined by 


Ch(F) = Tr exp (i=) 
=1. + Chigpmmcniee: (14.132) 


Ch,(F) coincides with c;(F’), while 


Chay eee 50 (14.133) 


A — genus 
Here we consider a background gravitational field with a Riemann cur- 
vature two-form R which also take values in the Lie algebra of SO(2n) for 
a 2n-dimensional manifold. The generating function for the A-genus is then 
given by 
A(R) = J] ee (14.134) 
sinh vi y 2 


t 
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where the x;’s are defined by 


1 0 0 
R art Oo § 0 0 0 ce i 
a 0. s0ieetae (14.135) 
T 


a) 
—) 
| 
8 

bt 
oO 


The idea of this representation is that invariant polynomials of R can be 
expressed in terms of sums of products of x;’s. The interpretation of (14.134) 
is that we expand it and re-express the sums of products of z;’s involved in 
terms of traces of products of R/27 using (14.135). Expanding (14.134) to 
the lowest nontrivial order 


“ ABA 1 2 


4 


= (1-74 re] 


i 
-1 Sy poe (14.136) 


Euler class 

The Euler class is zero for an odd-dimensional manifold. For an even- 
dimensional manifold it is defined as follows. Consider a real (2n x 2n-matrix 
M. Its determinant is a perfect square, so that we can write det M = (e)?. 
This defines the Pfaffian e(M). The Euler class is now given by substituting 
R/2n for M with wedge products of the R’s taken. For a 2 x 2 matrix, we 
can bring it to the form 


Ones 
M= ce 4) eaesh) 
to evaluate det M = $(€a,M®)?, giving 
i 
e(M) = Tear” (14.138) 
In four dimensions, we start with 
0 Ti 0 0 
= oo! bi 0 0 0 
M= 0 0 i (14.139) 
0 0 -z2 O 
with det M = (€abcaM?®M“)? /64. This gives 
1 
e(M) = ——searceaR? RM (14.140) 


3272 
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The Euler number of a closed compact manifold was defined earlier as 


x(M) = $0 (-1)*b, (14.141) 
k 


where by are the Betti numbers, the dimensions of the cohomology groups of 
M with real coefficients. The Gauss-Bonnet theorem is the statement that 
this is given by the integral of e(M) over M. Thus for dimensions 2 and 4, 


— 1 ab 
XM) = = fea, n=? 
1 
ar / EabedR’ RY, =n=4 (14.142) 
M 


There are many other characteristic classes, such as the Todd class, the 
Hirzebruch polynomial, etc., which are of interest in the context of index 
theorems and anomalies. We do not give them here since we do not discuss 
any application involving them. 

The characteristic classes consist of differential forms built from products 
of two-forms; for a manifold of dimension 2n, the terms in these expansions 
beyond rank 2n differential forms are obviously zero. One of the places where 
the topological properties embodied in the characteristic classes arise natu- 
rally is in the context of index theorems related to zero modes of physically 
interesting operators like the Dirac operator. This will be discussed in Chap- 
ter 17. 
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15 Path Integrals 


15.1 The evolution kernel as a path integral 


In this chapter, we will consider the path integral representation of time- 
evolution in quantum theory. 

Consider a state |i) at time t’. Time-evolution of this state is given by the 
operator e~‘*(t-t) Denoting the time-evolved state by |), we can write 


[b,t) = e*At—-*) 5. 4!) (15.1) 


where H is the Hamiltonian. The scalar product or overlap integral (f |, t) 
gives the amplitude for the time-evolution of the state |i,t’) by H (which 
may contain interactions) to generate |f). Thus the transition amplitude is 
given by (fle~*#(t-)|j). Usually, we consider transitions to happen over an 
infinite interval of time and the preparation of |i) to be in the distant past 
and the measurement or detection of |f) to be in the distant future. This is 
an idealization of the usual case where the measurements are done before the 
interactions under study become effective and after they cease to be effective. 
In this idealization, we can define the S-matrix element by 


Spi = lim lim (fle?) i) (15.2) 


t 
We shall derive the path integral formulae for (15.1) and (15.2), first for a 
quantum mechanical system and then the generalization to field theory. 
Consider a three-dimensional quantum mechanical system, with g;, pj, 1 = 
1, 2,3, being the coordinates and momenta, respectively. The Hamiltonian is 
taken to be i 
H= 5P + V(q) (15.3) 


(We absorb the mass into the momenta.) For this system, we can choose a 
set of g-diagonal states |g) with the orthogonality and completeness relations 


(qq) = 6° (q-’), / |a) d°q (q| =1 (15.4) 


By taking the scalar product of (15.1) with (q| and also using the completeness 
relation, we get 
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(gly, t) = i dq! (gle*#-*) (q/') (q/ |i, t’) (15.5) 
We can write this as 


vat) = f Pa Kata .tHd.t) (15.6) 
We have used the definition of the wave functions, (q|w,t) = W(q,t) and 
(q' |i) = Y(q’). The evolution kernel K(q,t, 9’, t’) is given by 


K(q,t, ght’) = (gle*#¢-*)|q’) (15.7) 


Evidently the S-matrix element is given by 


Sy; = lim lim d°qd°q' UF (q)K(q,t,q',t')Vi(q’) (15.8) 
t—co t’-+—co 
We now write a path integral representation for K(q, t,q’,t’). Divide the 
time-interval (t — t’) into N steps, each of length ¢«, so that Ne = t — t’; 
eventually we shall take « — 0, N — oo, keeping Ne fixed. We label the 
intermediate times by t,, with to = t’, ty = t and denote ty, — ty_1 = €~. We 
can then write 


K(q,t, qt) Sages 9 ce hg) (15.9) 


Inserting the completeness relation of (15.4) between every two factors of 
e tHe we get 


Kitg,t,q',t ) = f ayaa (qlee lgcaigageulen dagen 
-+-(qile"*%"\q’) (15.10) 


In order to evaluate this, we need the matrix element (q;,|e~*”** |q,_1). The 
difficulty in evaluating this is primarily due to the fact that the Hamiltonian 
has both p- and qg-dependent terms, and they are in the exponent of this 
matrix element. We begin by separating them. Using the formula e4+? = 


e4e3e—214.51... we can write 
ete(p? /2+V(q)) aa e-tV (4) p—tep?/2,- 5 [p? /2, V(q)] Pe (15.11) 


The ellipsis refers to terms of order e? which are not displayed. Since € goes 
to zero eventually, one can neglect the term of order e?. The term of order 
e* goes like ie?(pF + Fp)/4, where F = a is the force. Since pe is the 
distance traveled by a particle in time €, we see that this term is roughly e€ 
times the kinetic energy transferred to the particle in time ¢€ by the potential. 
In most situations, where the potential is not too singular, this will go to 
zero as € — 0. This is the case we shall consider. (There could be exceptional 
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situations where the operator Fp + pF has divergent matrix elements and 
this argument will have to be modified.) In the case where we can neglect the 
terms of order e”, we can write 
= _3 Berea 
(qe le*"** |gx—1) = (gele**Y Me-*eeP /2\q,_ 1) + O(€?) 
La = 2 
= eT NeVCAW) (age ***P /2lqy_1) + OR) (15.12) 
The matrix element involving the momentum operator can be evaluated by 
using a complete set of momentum eigenstates. 
aes d* pK Eee 
ale" Iga) = i (anys \lPe) (Pale ene iaee) 


dp, a 2 
= Q@nje° teePe/2 (ay. |e) (Dke|Qk—1) 


= | —£*. ¢ipr(ge—gr—1) p—tenp;,/2 15.13 

(27)8 ( ) 
The momentum of the eigenstates used has been specified with the subscript 
k, since this pertains to the matrix element between gy, qx—1. The integrand 
is oscillatory and the integral can be evaluated by giving € a small imaginary 
part. Equivalently, we can define the integral by 


3 3 
GPK eiPk (Qe —qk-1) p— ten PE /2 = | d° pr ite ae 
€x=1€K 


Qn)? Qn)? 
= looper 
(27€x)? Ee=ier 
' oY abel si (15.14) 
Ti€,)? 


We define a path q(t) by specifying the values of g at various times as follows: 
q(t’) =q =, Wt) =M,°:+, a(tk) = a,°++, 4(t) =n = 4. This is defined 
by a sequence of straight line segments and is only continuous in general. 
The quantity (qx — qx—1)/€~ becomes the velocity or tangent to the path as 
ex — 0. Denoting this by g,, we can write 


p [i (342 — Vax) €x] (15.15) 


etter k—-1) = ————s ex 
(gx \dx—1) (nie)! 


Using this result in (15.10) and taking the limit « — 0 so as to eliminate 
errors due to the neglect of O(e”)-terms, 

N-1 

d? qx 1, 

loner .¢ ) = ali | —<—<——; ome 

(qt.0,t) N-+00,e+0 IN (Qrieps1)2 (2ri(t; —t’))? 


x ef ne L3G — V(qe—-1)] 
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N-1 
= jj a 4k 1 iS(q,t,9',t'] 
= ,_jim Tonia = pee 
N-+00,€-40 pr (27 i€K 41)? (Q7i(tr — t’)) 
(15.16) 


where we have defined 


Sig,t,¢ 4 V(qx)] € 


N 
= 2 beat 
‘s _ — qr-1)? 
k=1 


2, —ten) te (15.17) 


This has the following interpretation. g(t) as given by the values of gq, de- 
fines a path from q’ to q in time (t — t’). As we vary qx = q(t,), we are 
considering different paths. The integration over all q, is thus equivalent 
to summing or integrating over all paths connecting q’ and q in the time- 
interval (t — t’). The weight factor for each path (or the amplitude for each 
path) is exp(7S[q, t, q’, t’]). The quantity S|q, t, q’, t'| is the eleaniial action for 
the specified path connecting (q’,t’) and (q,t), since re = $q° — V(q) is the 
Lagrangian corresponding to the Hamiltonian H = ip? + V(q). The sum 
in (15.17) may be considered as the evaluation of re integral [ dtl by a 
discretization procedure. Define e*°” as the amplitude for the particle to go 
from (q’,t’) to (q,t) along a path P, Sp being the classical action for the 
path. Expression (15.16) then tells us that the total amplitude for a particle 
to go from q’ at time t’ to q at time t is given by the sum (or integral) of 
the individual amplitudes e*S? over all paths. The summation or integration 
over all paths is to be defined by a discretization procedure and involves the 


measure 
N-1 


; d? ay 
Dq| = > lua ——_ 
q| N-co,e30 Ul (Qriepy1)? 
The integration over all paths, because of the oscillatory nature of the ampli- 
tude, has to be defined for imaginary time and the result is to be continued 
to real time. In other words 


ii [Dg]efSiata#] — / Daler Sere] (15.19) 


ttt 


(15.18) 


where Sz is the Euclidean or imaginary time action defined by 
1 {dq : 
Sz — { — 
E fe E (3) a via] (15.20) 


15.2 The Schrodinger equation 


For a point particle in three dimensions, the path integral gives 
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Baqn-i-1 


Si 
lange. Ue yt 15.21 
i=0 [2ai(tn_; = tn—i-1)]? (qo 0) ( ) 


Ne 
Se [ee —i — QN-i-1 M9 


2(tn-i-tn-i1) Vans) (tw-+ ~ ty-+-1)| (15.22) 


Differentiation of (15.21) yields 


OY (qn, tw) 1 (qn —9n-1\" iS 
Otn =| ea eae eheoate) 


3 
— 4— ————__V t 15.23 
2 (tn = tn—1) (qn, Nn) ( ) 
From (15.21), we also get 
OW(qn,tn) _ f (an - qn-1)" is 
———— +e W(qo, t 
qi (ty = tn—1) (Go 0) 
~i(ty tn) (15.24) 
Differentiating once again, 
PY (qn, tn) 3 (qn — 4Nn-1)? ; 
NY, = een) oe t 
V - oe 
erie rie pales 2a EE 
i(tw — tn_1) oe 5 Y(qn, tn) ae aon 
+ (qn — Qn- v5, to) (15.25) 
Comparing (15.23, 15.25), we get, as (tw — tw—1) — 0, 
OU (an, tn) _ ia? 
ee ——— vA ; 


Thus the wave function as defined by the path integral, not surprisingly, 
obeys the Schrédinger equation. 
15.3 Generalization to fields 


In this section, we extend the path integral to field theories. Consider a scalar 
field theory described by the Lagrangian 
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L= 35 [¥? — (Vo)? —m?y?*] — Vy) (15.27) 
We have 
r=¢ (15.28) 
H= | a (nb-£) " 
= i) dx {4n? + 3(V¢d)? + 5m?¢? + V(¢)| (15.29) 


where ¢ is the operator corresponding to y. In discussing this theory in terms 
of a many-particle interpretation, we have used the expansion 


(2) = S—[agux(z) +ahug(c)] (15.30) 
k 


where uz(x) = e~*? /\/2u,V, kx = kot —k-x = uxt—k-@, wp = Vk? + m?. 
The above relation can be inverted as follows: 


an = i f dx [uf(2)Oo0(e) — (Buz) o(2) 
: i Bx ue(e)Boote) 
al =i f ax [ue(2)Oo0(w) ~ (oun) 
=e / da jus(e)So0%()| (15.31) 


The integrals are taken at fixed time. 

We want to define the states |) which are eigenstates of the field operator 
@ at some chosen time. Toward this, consider the free theory first. At t = 0 
define 


> ee 
= er eke x 
ona | Yi 
ili : 
th = | Bx —= e** x(x 15.32 
sa = x(a) (15.32) 
The canonical commutation rules can be written in terms of the Fourier 
components (15.32) as 


[yx, Pi] = 0 
ras) 8 
[yx, m] = ¢ Ok41,0 (15.33) 


The situation is similar to ordinary quantum mechanics, with a countable in- 
finity of dynamical variables yx, 7, y, being the analogue of the coordinates 
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and 7, being the conjugate canonical momenta. Since the y, commute among 
themselves, we can define y-diagonal states as states which diagonalize the 
(x's. Going back to (15.31), we see that 


ap= (te — twp Yr) 

| al = ~i(n_p, + iwny_r) (15.34) 
Diagonalizing yy is equivalent to diagonalizing (a, + al), which is differ- 
ent from diagonalizing the number operator axal. The many-particle states, 
which are also eigenstates of the free Hamiltonian, can be represented in terms 
of wave functions which diagonalize the y,’s. For example, the wave function 
of a state with n, particles of momentum kj, n2 particles of momentum ko, 
etc., can be written as 


eo cote ae 
Dring eeaes n,[¥] = (y|ni, ki, ne, ka, Fa) = (p | Al at van |0) 
(15.35) 


On such a wave function, the 7;’s act as differential operators. From the 
commutation rules (15.33), we see that 


TY [p] = — 


6 
1 15.36 
ies (15.36) 
The ground-state wave function satisfies a,Wo[y| = 0, which may be rewritten 
as 


Ean + xr Doly] = 0 (15.37) 


As in the quantum mechanics of the harmonic oscillator, we can solve this to 
get 


Woly] = N exp [—5 2, we pape] 
= pe exp [—$ f Bad®y v(x) Q(x, y)e(y)| (15.38) 


Qz,y) = 7 ae (2—-y) 


where NV is a normalization constant. YW is an infinite product of the ground- 
state wave functions of harmonic oscillators, one for each k. This result could 
also have been obtained by solving the Schrédinger equation, after writing 
the Hamiltonian in terms of yx, Tx. 

Given the states |y) and the fact that we have an infinite number of 
quantum mechanical systems put together to form the field theory, we can 
immediately obtain 
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Dy, t] = / [dig] ef Stet) wi io! 2] (15.39) 


where 


(15.40) 


N-1 
a 
d lim | 
ache IT UJ ov (2m t€a41) 


The action S(y, t, y’,t’) is the classical action f dtd°x Lie, Ov) evaluated for 
(x,t) with y(a,t’) = y'(xz), v(a,t) = y(@). It is to be interpreted by a 
discretization of the time-interval, splitting it into N intervals, with N — oo 
eventually. The integration over time in defining the action is thus replaced 
by summation. As before, the integral is defined by the Euclidean integral 
appropriately continued. The S-matrix element can be written, up to overall 
normalization, as 


Si= lim i [dp]? [p, Wi [y’, t'] Sete") (15.41) 


coon 


We shall rewrite this in the form of a reduction formula. Consider the process 
where we have incoming particles of momenta k,,k2,---,kn and outgoing 
particles of momenta pj, p2,---,pm. From (15.35) 


Wily] = (vlaj, af, --- 24,10) 
= / da,--- Bry [in (e1)30 a [nits («w)Bon} 
x (y|b(21) (x2) --- d(x) |0) 
= i: I ius. (2) Boxete)| Volvo (15.42) 


where we have used (15.31) and the fact that (y|@ = (y|y. With a similar 
way of writing the wave function for the final state, we find 


- /U ins, (v5)3o5| ins, (21)Ba flavins 
x 9(y1)-+- e(ym)y(a1)-+- olan) eS) 
M pes: a 
= i TI iup, (yj) 00; if} ~ing(e) G(y1, Y2,°°*YM,21,%2,°°° In) 
i 1 


(15.43) 


In the above formula, it is understood that we should take ys oo, 22 > 
—oo. We have also defined the Green’s function 


G(z1,£2,+--, tN) = [ delvstelvola e) (a1) p(a2)-+-p(an) (15.44) 
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The Green’s function involved in (15.43) can be written in terms of the vertex 
function defined as follows: 


G(y1, Yo,°**, YM, V1, 22,°**, EN) 
= feu : daly G(y1, yi) Glan, wy)V (yh, «5 2) 
(15.45) 


where G(y1, y;), etc., are the two-point Green’s functions or propagators. The 
support of the vertex function is roughly over the interaction region. If we 
assume the interactions are negligible in the far past and far future, then we 
may use the fact that eventually y? — oo, x? — —oo to write 


d*k — etk(u—vi) 
(27) k2 — m2 + ie 
3 
= d'k OK 1 ik —¥;) 
(27)3 Qu, 


= X ur(yr ue (yi), yi >yP 


G(y1, 94) SU 


dike) 1 
(27)3 QW. 


= Se uz, (r1)ur(x4), ge <a? (15.46) 
k 


—— etk(t1-24) 


G(x1, 24) = 


We shall simplify (15.43) by using the formulae (15.45, 15.46). The simplifi- 
cation of the integrals involved may done using 


[ Bvugty) Bo Tuewugy) = 45(V) (15.47) 
k 
This result finally leads to 
re = f ay, dee, II [us (Yi) ees (25 ) V (oe) (15.48) 
The propagators satisfy i(O+ m?)G(z, y) = 6 (x — y), so that 


[[#G.i+m’)G(,:--,2N) = V(xi,---, tN) (15.49) 


a 


We can thus write the formula for the S-matrix element as 


=I, (y;)(O 3 +m?) [Tina (ws)( i +m?) Gyn, 5 Ys L1, EN) 
(15.50) 
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where the Green’s function G is given by (15.44). (Of course, G is to be 
evaluated in Euclidean space and then continued to Minkowski space. The 
overall normalization factor must be fixed by requiring (0|S|0) = 1.) 

In such a framework, as in the discussion of functional integrals and renor- 
malization earlier, calculations have to be done by rewriting the action in 
terms of renormalized fields and the renormalization factors Z’s. The action 
is then of the form et 


S= / Z3 [3 (Op)? + (m? — dm?) y?] + Z Ay" (151658) 


where the renormalization constants Z;,Z3,ém? are terms to be fixed by 
requiring the elimination of potentially divergent terms when the short dis- 
tance cut-off goes to zero. We have written out (15.51) with a specific choice 
of the interaction term. More generally, every monomial of the fields requires 
a separate Z-factor. 


15.4 Interpretation of the path integral 


The field configurations y(x) at a fixed time form the configuration space C, 
1.€., 
C = {Set of all y(z) : R* — R} (15.52) 


Thus each point of C is a field configuration (a) for all spatial points x taken 
together. The wave function Y[y] gives a complex number at each point of 
C. As the time label t varies, y(a,t) gives a path in C. Thus (15.39) can be 
interpreted as 
Viyptq}= SY) efSrlel w[y',t'] (15.53) 
all paths P 


where S[v] is the action for a path P in C connecting y’(x) at time t’ to p(x) 
at time t. Basically this result is the Huygens’ principle of wave optics; the 
only difference is that the paths are now in the configuration space C, rather 
than spacetime. 

We shall now turn to some of the topological aspects of the path integral. 
To recapitulate the results so far, the basic path integral can be written as 


¥[Q, t] = / [dQ] ef519-#.9' #1, 19’, £1] (15.54) 


where Q specifies a point in the configuration space C and the integration is 
over all paths connecting the point Q’ at time t’ to the point Q at time t. The 
time-variable ¢ parametrizes the paths. When the configuration space C has 
nontrivial topology, there are many subtleties involved in defining the quan- 
tum theory. Two cases of particular interest are when the fundamental group 
of C, viz., I;(C) is nontrivial, and secondly when there are two-dimensional 
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closed surfaces in C which are not the boundaries of any three-volume along 
with the existence of closed two-forms which are not exact, i.e., H?(C) # 0. 
We shall briefly consider these two possibilities. 


15.5 Nontrivial fundamental group for C 


The simplest example of a nontrivial fundamental group for C is that of 
particle motion on a 2-plane with one point, say, the origin, removed; we 
have discussed this case in the last chapter. J7;(R? — {0}) = Z and the 
nontriviality of a closed loop C’ is measured by the winding number 


ee ie. 
a = —~———dr’ (15.55) 


I i ‘h a (15.56) 


for two open paths P, P’ which are homotopic to each other. 
Given the structure of paths and the winding number, we can generalize 
the path integral as 


Since a is a closed one-form, 


V(2,t) = fies poe) Pads aC) (15.57) 


6 is an arbitrary parameter. A general path from x’ to x may be written as 
P+C, where P is an open path with no noncontractible loops around the 
origin and C, is a loop around the origin with winding number v. Evidently 


/ a=y + ik (15.58) 
P40, P 


We can thus write the path integral (15.57) as 


U(2,t) = 2 esp ti9 ee ci » else tio fate ee eee) 
allP~P allP~P+C; 
- SS y eis ti6 fp ative v(t’) (15.59) 
Vv allP~P+C, 
The summation is over all paths P which are homotopic to P+C, , for each 


value of v. The overall phase exp(i {,, a) does not matter for matrix elements, 
so we have 
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E(cjy=S_ yO vee (15.60) 


V allP~P+C, 


Since v is an integer, we see that W is unchanged under 6 — 6 + 27. The 
parameter 6 may thus be taken to be in the interval {0, 27]. 

In the path integral A appears in S/h in the exponent. The term 0(C) has 
no f in it. This is evident since 6 is an arbitrary parameter between zero and 
27 and v(C) is normalized to be an integer. Thus the 6v-term has no classical 
effect. We must regard 6 as an extra parameter (which does not appear in 
the classical action) which arises in quantizing theories with I,(C) 4 0. If 
we want to mimic the effect of 6v by a term in the action, we can do so by 
writing 


6 oe 


The extra “topological” term, we notice, has an explicit A in it. (It is also 
singular at + = 0; thus the point z = 0 has to be removed for this to make 
sense. ) 

In general, I7;(C) can be something other than Z. The general rule is 
obtained as follows. In addition to e**, include a factor K(C) for curves C 
where K has the following properties: 


1) K(C+6C) = K(C) 
2) K(C,)K(C2) = K(C, + C2) 
3) K(—C) = K*(C) 


The first property gives the invariance of K(C) under small deformations 
of the curve C; or in other words, K(C) should be a topological invariant of 
the path. The second property gives the composition law for the paths, while 
the third tells us that A(C) is a phase which changes sign under change of 
orientation. All these properties follow easily from the discussion so far. In 
total these properties say that K(C) is a one-dimensional unitary represen- 
tation of the group /7;(C). K(C) will possibly involve new parameters (like 
6); such parameters are new parameters which arise in quantizing the theory 
with the given nontrivial [7,(C). They are to be treated as extra coupling 
constants in addition to whatever coupling constants appear in the classical 
Lagrangian. The values of these parameters will take in the quantum theory 
have to be determined experimentally. 

The quantization rule can thus be restated as follows: 


v(Q,t) = / [dQ] e249") K(C) W(Q',t') (15.62) 


where (’s denote the coordinates of the configuration space. Some examples 
where this kind of topological feature is important are: 


1. Charged particle dynamics in the presence of a magnetic vortex. 
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2. Particles of fractional spin and statistics in two spatial dimensions. In 
this case, 9 can be taken to be in the interval [—7, 7] with 6 = 0 giving 
bosons and @ = 7 or (—7) giving fermions. The values in between give 
“anyons” or particles of any statistics. 

3. Quantum chromodynamics. The configuration space C is not simply con- 
nected and in fact IJ,(C) = Z. This leads to an additional term @v/A] 
in the functional integral for chromodynamics. v[A], which measures the 
nontriviality of paths in C, is the instanton number. 


15.6 The case of H?(C) 4 0 


We now turn to the second type of quantization problem which could 
arise when the configuration space has noncontractible two-surfaces. The 
quintessential example of this is the motion of a particle on a two-sphere. 
The path integral gives 


V(z,t) = i! [dc] W(P, 2,2) Ha, 1) 


W(P, 2,2’) = eS? (7) (15.63) 


We can think of e*5? as follows. For every path P between 2’ and z, e*°? 


gives a complex number of unit modulus; i.e., 
W(P, 2,2’) = e?@*) : set of paths > U(1) (15.64) 


The path integral gives the summation over all paths with this factor e** as 
the weight factor for each path in the summation. The new idea we explore 
is that we can associate weight factors for the path integral with surfaces 
rather than directly with paths. One way to do this is as follows. Choose a 
particular path, referred to as standard path from now on, between the two 
points x’ and z; call this Po. The path of interest is denoted by P. Now P— Po 
forms a closed loop and we consider the surface 3’ whose boundary is P— Po. 
(On the two-sphere such a surface always exists; more generally, the following 
construction holds in situations where there is such a surface. When P — Po 
is not the boundary of a surface, our construction has to be modified.) The 
coordinates x“(t) which give the paths can be generalized to x“(o, t) = x*(€) 
where €* = (o,t) parametrize the surface Y. The area element of the surface 
= can be written as 40;240,2"e7, i,j = 1,2, 1 = t,2 = o. The weight 
factor associated with the path P is now generalized as 


W(P, x, 2’) = eiSP(#,2’) el (2) 
P(Z) = kf ae 5 Buvdia" Oj" (15.65) 


where B,» is a function of x“, ie., Buy = By, (x) and is antisymmetric, i.e., 
Buy = —Byy. 
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There are many consistency requirements on such a term which we shall 
now examine. First of all, we may think of the extra term as an addition to 
the action. In defining the weight factor (15.65), we need the surface 2’ or 
z(a,t). 2“(t) defines the path, but the variable o has no meaning physically. 
The first property of the term (15.65) should be that the o-dependence of x” 
does not affect the physics. Alternatively, note that there are many surfaces 
which have P — Py as the boundary. Physics should ‘né6t depend on which 
surface we choose to define (15.65). The choice of different surfaces can be 
made to correspond to different functional dependences of z* on o and thus 
we must require that [’(’) be invariant under small deformations of »' or 
small variations of 2“(£). We have 


oy al dé [Ba Burd dj2d;2" 
2/5 
+ ByvO;(da")0;x" 9 
+ BuvOix"d;(5x2” ec? | 
k ” 
= i PE [Ox Buy + OnBra + 8, Boy] O2"Oj2" 452° 


+k ¢ dé; Buy O;x" 4) ba" ; (15.66) 
op» 


The first term is sensitive to the o-dependence of x (or choice of surface 
+’). We can obtain physics independent of the o-dependence of x or the 
choice of 4’, at least for small variations of the surface, if we require 


OxBuy + Op Bra + Op Boy = 0 (15.67) 


The second term depends only on the path P — Po, which is 0X’. Now 624 
is zero for the standard path Po since Pp is fixed once and for all. Thus 6I° 
depends only on dz” along P or on infinitesimal variations of P, provided 
(15.67) is satisfied. It is the only requirement for small variations of the path. 
We shall rewrite 6° in a more useful way shortly, but before we do so, a couple 
of other properties have to be discussed. First of all, consider solutions of the 
condition (15.67). One set of solutions is obviously given by 


where C;,, is some vector potential. In this case, we have 
ul 
2 
=k f &e Zic,aj2"€ 
me. a yOjx" 
= kg dé; CLO;2" 4 

as 


r=2 ‘i d?€ (0,C,)0;2"0;2" € 
a; 
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=k| fac, af dt Cy, | (15.69) 


The contribution of the standard path is a common phase factor for all wave 
functions since it is common for all paths. Therefore, we can drop it from 
the path integral. Thus we get W(P, 2,2’) = e*5P (*,2") for each path P, with 
S =S+k f dt C2". We simply have another term in S and all this discussion 
about »' is not relevant. The kind of topological term we are discussing 
becomes relevant only for B,, which satisfies (15.67) but which cannot be 
written as a curl of a vector as in (15.68), i.e., 


8a Buv + Op Bra + Boy = 0 (15.70) 
Buv $ OnCy — OC (15.71) 


Notice that B,, is antisymmetric in y,v by its definition (15.65); it is the 
component-version of a two-form on the configuration space, B = 5 By dct A 
dx”. The condition (15.70) is equivalent to dB = 0 and (15.71) to B 4 dC 
for some one-form C’. Thus the B’s we are interested in are closed two-forms 
on C which are not exact, i.e., elements of H?(C,R). 

Consider as an example the monopole field 


Ww An |al8 An 
1 

B= ue, Suva da" \ dx” (15.72) 
8 or? 


We then find 
dB = 5 (Oa Bw + O,Bre + OBay)da*dx’ dx” 
- stnvac@ (O° Bia) dx*detday 


MP el Fy,f i fait B y 
yg cde See So 7a] dx*da"da” 
us T 
= 0, for r #0 (15.73) 


On R3 — {0}, dB is zero. Integrating B over a two-sphere surrounding the 
origin 
B=1 (15.74) 
S2 

so that B 4 dC. The two-form (15.72) furnishes an example for R® — {0}. 
For the path integral for the motion of a charged particle in the background 
of a monopole we include a term exp(iI'(2)). 

This particular solution we have found is valid in R® — {0}; this space 
has noncontractible two-surfaces and indeed, we have used such a surface in 
arriving at the argument for the nontriviality of (15.72). 
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Consider now the case of R? with two points removed from it. In this case, 
we have different types of closed noncontractible surfaces. We can consider a 
surface S$; surrounding one of the removed points, or a surface Sz surrounding 
the other removed point. Evidently, these cannot be deformed into each other. 
We can also consider a single surface surrounding both points, but this surface 
can be decomposed, up to smooth deformations, into S; + S. In this case, 
we say that S; and S2 are generators of noncontractible two-surfaces in the 
sense that any noncontractible two-surface can be decomposed into copies 
of S; and S_. More generally, we need H2(C) which is the set of all closed 
two-surfaces which are themselves not the boundaries of any three-volume. 
H2(C) is called the second homology group of C. In the example of R? — {0}, 
H2(C) = Z and there is one generator, namely, the unit sphere surrounding 
the origin. For the example of R° with two points removed, H2 = Z+ Z, and 
we have two generators, S; and S2. In general, we can define a B,,, for each 
generator as proportional to the area element of the generating surface. Such 
a B,, can be normalized by requiring ¢, By, dS” = 1. 

We now go back to the path integral and consider the specification of X’ 
again. In general, there are several surfaces which have P — Pp as the bound- 
ary. Consider two such surfaces ©’ and 5”, where one is a small deformation 
of the other. Since they have a common boundary, we have a three-volume V 
in between the surfaces, such that OV = ©’ — X&”. Then by Stokes’ theorem, 
we have 


I's —Iyy =k dé © Bi OiotO,0% =k i B 
pay 8 me 
a | dB 
V 
= 0, by (15.70) 
(15.75) 


Thus it does not matter which surface we choose. This argument works only 
for 3’, &” such that 3’ — &” is the boundary of volume V. If H2(C) is not 
trivial, there are cases where 1 — 4%” is not the boundary of any volume. 
For example, for R* — {0}, we can use Y and SZ” such that Y — D’ is the 
two-sphere surrounding the origin. This is noncontractible (since the origin 
has been removed) and further there is no volume V such that OV = 5 — Dd’. 
In this case 


[y -—Ty =k B 
S2 
i (15.76) 
where we have used the result (15.74). In the path integral, we thus get 


els — e'Tz" eth (15.77) 


15.6 The case of H7(C) #0 357 


We can get results independent of the surface if we impose the further con- 
dition e* = 1 or k = 2rn, n€ Z. Thisisa topological quantization rule. In 
the context of the charged particle in a monopole field, this is again the Dirac 
quantization rule. With this condition, we may summarize the results as fol- 
lows. If the configuration space C has noncontractible two-surfaces, (and if 
every closed curve is the boundary of some surface), it is possible to generalize 
the path integral as 


U(Q,t) = / [dQ (Qt #+T> WCQ’, 1) (15.78) 


where %’ is a surface whose boundary is the path P and a standard path 
—Po. I's is given by 


1 ann f B (15.79) 
oa 
where 7 is an integer and B is a two-form on C with 
Cp = 0 (15.80) 
B#dC (15.81) 


and B is normalized, i.e., 
i esa (15.82) 
iB 


where S is a generator of the group of H2(C). A term like (15.79) with the 
conditions (15.80, 15.81) is called a Wess-Zumino term. All conditions are 
such that no h appears anywhere and this term also has no classical effect. 
(Generally, nontrivial H2(C), rather than H7(C), is more relevant to the kind 
of topological feature of the path integral we are considering. We can specify 
weights for paths in terms of surfaces whose boundaries they are. We have 
chosen to write these weights in terms of integrals of a differential form, 
thereby bringing in the cohomology group. Such a differential form is obtained 
in all the problems we are considering and so this is adequate for our purpose.) 

The Wess-Zumino term occurs and makes a difference in a number of 
physical situations. Examples are: 


1. Motion of a charged particle in a monopole background 

2. Low-energy behavior of quantum chromodynamics, in particular in in- 
terpreting solitons of meson fields as baryons. (Such solitons are called 
skyrmions. ) 

3. Anomalous gauge theories 

4. Boson-fermion equivalence in (1+1) dimensions. 
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16 The Configuration Space in Nonabelian 
Gauge Theory 


16.1 The configuration space 


We now apply some of the general analysis of the previous chapters to the 
specific case of the configuration space of an unbroken nonabelian gauge 
theory. Quantum chromodynamics (QCD) is, of course, the most interesting 
theory of this type. We will see that 17,(C) = Z for QCD. As a result we get 
a new parameter 6, which is needed to characterize QCD. 

Consider a pure gauge theory. We use Ap = O gauge to simplify the 
analysis. The theory is thus described by potentials A;, i = 1,2,3, which 
are Lie algebra valued. They can be taken to be antihermitian matrices, 
A; = —iA?t*, where t® are hermitian matrices which form a basis of the Lie 
algebra of the gauge group. For simplicity we shall consider an SU(2) gauge 
theory. Gauge transformations act on A; by 


A; ~ Al =9 Aig —Og 97 (16.1) 


where g(x) : R? + SU(2). One cannot consider g(a) which depend on time, 
since such transformations would change the gauge choice Ag = 0. Thus, once 
we choose Ag = 0, the transformations above represent the residual gauge 
freedom of the theory. The spaces of relevance to us are 


A {sme of gauge potentials at (16.2) 
c.— {sone of gauge transformations g(x) : R? > SU(2) 


such that g > 1 as |x| > ~| (16.3) 


(This is similar to what we discussed in Chapter 10; there we defined and 
used similar spaces for potentials in four-dimensional Euclidean space, for 
the purpose of the functional integral. Here the potentials and gauge trans- 
formations are at a fixed time and are functions of the spatial coordinates. 
Even though we use the same notation, the context should make clear what 
is meant.) 
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As we have discussed before, transformations g(x) which go to a constant 

element gx. # 1 act as a Noether symmetry. The states fall into unitary 
irreducible representations of such transformations, which are isomorphic to 
the gauge group SU(2), up to G,-transformations. The true gauge freedom is 
only G,. The configuration space of the theory is thus C = A/G,. In analyzing 
the structure of this space we need information about A and G,. 
(Note: The action of G, which is the set of gauge transformations which go 
to a constant g.. #1, on A has fixed points; e.g., A; = 0 is left invariant by 
the action of constant g’s. Thus A/G will have singularities. Mathematically, 
to avoid such singularities, one must use A/G,. Our discussion of Chapter 10 
shows that this is also the relevant quantity physically.) 


Topology of A 


A is an affine space, i.e., any potential A; can be written as Al) +h; where 
A is a fixed potential and h; is an arbitrary Lie algebra valued vector field. 
Equivalently, any two points in A can be connected by a straight line. If 
AMY) and Av?) are two points in A, i.e., two gauge potentials, the sequence of 
configurations 

Aint) = AM (1 —T)+ 7A) (16.4) 


where 0 < 7 < 1 provides a straight-line interpolation between A‘ ) and AY. 
A thus looks like an infinite-dimensional Euclidean space. Topologically it is 
a rather trivial space. In particular [Jp(A) = 0, 1,(A) = 0. 


Topology of G, 


G,, is made up of mappings g(x) : R® — SU(2) with the condition g(a) > 
1 as |x| — oo. Consider an arbitrary element of SU(2). The group SU (2) can 
be thought of as the group of (2 x 2) unitary matrices of unit determinant. 
We can parametrize an element of SU(2) as 


9 = $0 + idioi (16.5) 


where o; are the Pauli matrices and the condition gi = g~! implies that 
go, $i are real. The condition det g = 1 requires 


$5 + Des ge? =1 ; (16.6) 


The group SU(2) is thus topologically a three-sphere. The ¢,(xz), p = 
0, 1,2,3, thus give a mapping R°? — S%. Further we have g — 1 as |a| — oo. 
For the case of such maps, we can think of space R? itself as being a three- 
sphere. Explicitly this can be realized as follows. We define a mapping 


x? = 1 22; 
a? +1’ oe ea = 
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Evidently y2 + >; y? = 1; the y’s define a three-sphere. (16.7) thus gives a 
description of R® as a three-sphere, spatial infinity corresponding to |a| — oo 
being mapped to the pole yo = 1, y; = 0. Now, given an SU(2)-valued 
function on S%, i.e., given g(y) where PL. a = 1, we can use (16.7) to write it 
as a function on R® with g — go, goo not dependent on angles as |a| — oo. In 
particular, choosing goo. = 1 requires g(1,0,0,0) = 1. We are thus concerned 
with maps S* — S%, where the first S* represents space R® via the map 
(16.7) and the second S* is SU(2). The classes of such maps will be given by 
TT3(S3) = Z. 

We shall analyze these maps by defining a winding number Q. Consider 
¢ : S3 — $3. We want to determine how many times the target sphere S° 
is covered by the map ¢(x) as we cover the spatial S* once. The volume 
element for the image generated by ¢“(x) is given by 


dS,,€4”°F = 0:60; Ong" &* Ba (16.8) 
or 
dS, = sSuvase OG"; GOS an 
= gd) | (16.9) 
where 
d2®) = aeuvapet* gH O:6” 0;¢° OG dx (16.10) 


Since the volume of S? is 27”, we get for the winding number 


Q{g] = 5-3 {volume traced out by ¢4(x)} 


~ 
~ [272 


/ Br €poapes* o"0:¢"0;6*O.07 (16.11) 


This can also be written directly in terms of g(x) as 


1 - a 7 ' 
Ol aes pes Tr (90:9 9~*0;9 9‘ Ong) €* 
I * 
=Gee [xo 1dg)° (16.12) 


We have already seen this formula from our discussion of the cohomology 
and homotopy of Lie groups. The arguments given here will provide a more 
explicit realization. 

We now show some important properties of Q[g]. 


a) Q is invariant under smooth deformations of ¢. 


We have seen this in terms of g in Chapter 14; we will now show the 
same result in terms of ¢. Consider deformations ¢4(x) — g(x) + d¢*(z2), 
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which preserve the boundary condition ¢4(x) — (1,0,0,0) as |a| — oo. This 
requires 6¢4 — 0 as |x| — oo. Further 6¢“¢, = 0, since we have to preserve 
ob, = 1. For the change in Q to first order in 6¢” we find 


Qld + 54) - QI6) = a5 | ox [Soa6" nad nag? 
a Adg* de? 
+ pdb” A d(5¢*) A do? 
+ abtdd? Add* A d(59")| euvap 
= = il dx [5o*dd” Add A dé” |ewas 
(16.13) 


where we have done partial integrations on terms involving d(é¢). The surface 
terms are zero, since 6¢ — 0 as |x| — oo. Now, since 6¢- ¢ = 0, we have 
od = e+¥°8h, wag for some arbitrary antisymmetric tensor wag. Using this 
in the above equation, 


Qlb +56] - QI] = a5 | Suk bowed" dd” Ade” 
0 (16.14) 


since all terms, upon expanding Oe , involve one power of 6¢- ¢. Thus Q is 


invariant under smooth deformations of ¢“(x) which preserve ¢* — (1,0,0,0) 
as |a| — oo. 


b) Q[gi92] = Q[91] + Q[g2]. 
Consider Q[gig2]. We shall use (16.12) to evaluate this. We have 


(9192) *d(g1g2) = 93 ' (9; 'do1)92 + 92 ‘doe 
= 9,'(A+ B)go (16.15) 


where A = g; 1dg. and B = dgo 92 1 Using this decomposition 


Q[9192] = af Tr(A + B)(A+ B)(A+B) 


2472 
1 


yi i [TrA® +TrB® + (34?B + 3B7A)] 


= Qloi] + Qlaal - = / Tr(-dA B+ AaB) 


= Q[91] + Q[ge] + _ / d(TrAB) (16.16) 
= Q[9)+Q[g2] (16.17) 
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(The wedge products or antisymmetrized products are left understood in this 
and some of the following equations.) We see that the winding numbers add 
when we take products of the group-valued functions g; and gg. The possible 
surface contribution in passing from (16.16) to (16.17) is zero, because the 
antisymmetric product of A and B falls off sufficiently fast as |a| — oo. 

We now consider an example of a configuration for which Q = 1. This is 
given by a 
z?—-1 . 2z* 
eal ” ee 
which is equivalent to ¢“(x) = y#. The @’s form a three-sphere; so do the 
y’s. g4 = y gives one covering of the S°, which is SU(2), when y’s cover 
the S° corresponding to space (or equivalently R°). Thus Q = 1 for this con- 
figuration. This may also be verified directly from the integral (16.12). gi (x) 
is a smooth configuration with g; — 1 as |x| — oo. Now the configuration 
g(x) = 1 everywhere evidently has Q = 0. Since we have Q = 0 for g = 1 and 
Q = 1 for g = g)(x) and Q is invariant under smooth deformations, it is clear 
that g(x) cannot be smoothly deformed to the identity everywhere. Further 
we can consider g(x) = gi(x)gi(x), which has Q = 2 by (16.17). Also we have 
g = 9\(z), which has Q = —1, which follows from Q[gtg| = Q{gt] + Q[g] and 
Q[gig| = Q[1] = 0. The classes of maps g(r) : S? — S$? are now clear. We 
can write G, as the sum of different components, each of which is connected 
and is characterized by the winding number Q); i.e., 


gi(x) = (16.18) 


+0o 


Gi= >> @S.q (16.19) 


Q=—oo 


where G.o consists of all maps smoothly deformable to 1 everywhere; G.1 
consists of all maps smoothly deformable to gi(x) everywhere; and more gen- 
erally, G.g consists of all maps smoothly deformable to the Q-fold product of 
gi(z)’s. Gx—g is similarly defined using gi (az). Gag and Gxg: are disconnected 
from each other for Q # Q’, since if they are connected, (gi)° and (91)? 
should be deformable to each other and this is impossible since Q 4 Q’. 
Q’s add upon taking the product of two maps as in (16.17) and hence this 
structure is isomorphic to the additive group of integers Z. 
We can now analyze the structure of A/G,. Consider the line in A given 
by 
A;(x,T) = A;(z)(1— 7) + APT (16.20) 


for 0 < 7 < 1 or more generally 
A;(x,T) with A;,(z,0) = A;(z) 
A;(x,1) = AM (zx) (16.21) 


where A?" is the gauge transform of A; by gi(x) as in (16.1). This is an open 
path in A. But since A$’ is the gauge transform of A;, both configurations 
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A; and A‘? represent the same point in C = A/G,. Thus A;(x,7) describes 
a closed loop in C. The question is whether this loop is contractible. If the 
loop is contractible, we can deform the trajectory to a curve purely along 
the gauge flow directions which would then connect g = 1 to g = gi (x). This 
would imply that 9;(x) is smoothly deformable to the identity. But we know 
that this is impossible from our discussion of the structure of G,.. In turn this 
implies that A;(z,7) of (16.21) is a noncontractible loop. Thus I(C) 4 0. 
Further, we can repeat the argument with go = gigi, or with g3, 94, etc., 
and also with g!. With the composition rule Q[gg’] = Q[g] + Q[g’], we see 
immediately that 

IT, (C) = Ih( A/G.) = Z (16.22) 


Another way to see this is to use the exact homotopy sequence. From 
(16.19), G, has connected components labeled by Q, so that IHpo(G,.| = Z. 
Then 

— Il;[A] — Ih[A/G.] + Ho[G.] — Ho[A] (16.23) 


gives 17;|A/G.] = Z, since 1,|A] = Ho[A] = 0. 


16.2 The path integral in QCD 


From our general discussion of the path integral and the result (16.22) above, 
it follows that QCD has a parameter 0 < @ < 27 in addition to the gauge 
coupling constant. The path integral is given by 


WA, ¢] = / dpfA/Ge] eSA4) eA) DLA’ 2") (16.24) 


We still have to determine v[A]. As in our general discussion, v{A] should be 
invariant under small changes of the path and sensitive only to the “bound- 
ary values” A;(z) and A¥(x), in fact only to g. Further it must be Lorentz 
invariant with the parameter T now being the time variable t, since paths in 
the path integral are parametrized by time. We may thus expect 


a= / de 8,K"(A) (16.25) 
Carrying out the time-integration will give 
(A, A’) = i da K°(A) — i Ba K°(A’) (16.26) 
t=co t=—oo 


which is clearly insensitive to the path in between. Since Q[g] gives the wind- 
ing number for closed curves, we should have 


[ee [K°4") — KA] = Ql = -zey [Od 0827) 
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or 
1 
K°(A9) — K°(A) = ~ 573 Ir(g*dg)° + (total derivative in x) (16.28) 
Now, A involves gAg~' — dg g~!; further dA9 has (dg g~1)?. Thus in 
order to obtain (g~'dg)*, we need K°(A) to be made up of e/* A;0; A, ~ AdA 
and €7* A; A; A, ~ A®. We thus consider 
K°(A) = Tr [@AF + dA? (16.29) 
which gives 
K°(A9) — K°(A) = Tr [-avF — 3bvA? + 3bv7A — bv’ ] (16.30) 


where v = g~'dg. The last term with v? gives the required form for Q[g] and 
identifies b as (1/2477). The terms involving A must vanish or combine as a 
total derivative. This identifies a = —3b. We thus get 


1 1 


1 2 a 
= — yaa Tr(Aidj Ar + gAiAyAn)e ake (16.31) 
The covariant version is evidently 
K¥(A) = aD AvyOnAg + AA Ag| (16.32) 


K°(A) is called the (three-dimensional) Chern-Simons term. From (16.32) we 


find 
1 


327? 
Thus the path integral for QCD becomes 


0,,K"(A) =— tvOB Ty (Fy Fog] (16.33) 


5 Het? : t 
WA, t] = " Sac AAT ay ee a (16.34) 


For Green’s functions, one is interested in t — oo, t! — —oo. The path 
integral, which becomes the generating functional for Green’s functions, is 
then given by 


Z(J| = / dulA/Gu] eS(A+r1A) of de apa” (16.35) 


where we have included a source term e if A-J +o facilitate the calculations of 
arbitrary correlators. v[A] is given by 


1 


v[ Al} = ~~ 30R2 


/ d‘x e#¥°P Ty [F.,, Fag] (16.36) 
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v[A] with integration over all time (and space) is known as the instanton 
number. The density 


ee 
3272 


is the second Chern class of the gauge field A; —v[A] is the second Chern 
number. — 

An important property of c#”°°Tr[F.,Faglis that, because of the €- 
symbol, it has only one time index and three spatial indices. As a result, 
it is odd under parity P and time-reversal T. Its presence in the path inte- 
gral thus leads to T-violation by strong interactions if 6 # 0. (Our discussion 
has been for SU(2); the gauge group has to be extended to SU(3) for the 
case of QCD, but all the above analysis goes through for all compact gauge 
groups since they always contain SU(2)-subgroups and J7o[G.] = [13(G) = Z 
for all compact Lie groups, except for SO(4) in which case it is Z x Z.) One 
consequence of this T-violation is that the neutron can have a static elec- 
tric dipole moment. From experimental measurements of the neutron electric 
dipole moment, we get the experimental limit on 6 as 


1 
C2 eels [Fy Fag qa NF \ F) (16.37) 


us 


|4| < 10~° (16.38) 


This abnormally small value of @ has been a puzzle, referred to as the strong 
C'P-problem. It suggests that 6 = 0 by virtue of some symmetry. Attempts 
to understand this small value of 9 has led to the concept of axions and 
associated interesting ideas. 

Another important consequence of the presence of the e+” is that the 
Euclidean continuation of v[A] does not pick up any factors of i = /—1. 


d’a MPR Fag ~ dtdea e* Fy: Fj, = drtd’a eo" Fy:Fj, (16.39) 


where x* = it = ir®. The formula (16.35) for Z[J] can be written, in terms 
of a Euclidean functional integral, as 


Z|J] = zeta 


z4=in? A,g=—iAo 


Zp[J] = / dp[A/Gu] eSB A+i0-14] gi f AJ (16.40) 


16.3 Instantons 


We have shown that IJ,(C) = Z. There are noncontractible paths in A/G... 
An example of such a path is 

1 e*4 
ere 7 ae a | 


A;(x, 24) = A;(a) Ali (a) (16.41) 
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which has the property that A;(a,24) — A;(x) as x24 — —oo and A;(a,24) > 
A§* (a) as x4 — +00. By construction v[A;(x, 24)| = 1. The contribution of 
such a path to the path integral is given by exp(—S[A(z)] + 10). Of course 
there are an infinity of paths which can be deformed to (16.41). In sum- 
ming over such paths, the dominant contribution will come from paths with 
the least Euclidean action. We thus expect certain extremal paths with the 
properties 


Ly Om =0, ie, DuF"” =0 for pure Yang-Mills theory 

2. Ao = Ag=0 (our gauge choice) 

3. A;(@,rz4) +0 astg—+—00, A;(x,24) - —Oig; gj’ as 24 + 00 

4. SplAj(x)] < co (otherwise it is irrelevant for the path integral since 
e~* —+ 0 for S > 00) 


In condition 3, we have considered, for simplicity, the starting config- 
uration as A;(x) = 0. Apart from our gauge choice, these conditions are 
equivalent to 


D,F*” =0 
Vaya 1 
SpA] < 00 (16.42) 


Configurations A,,(x) satisfying (16.42) are called instantons. Configurations 

with v[A] = —1 will give anti-instantons. A,(x)’s with v[A] =integer, (A 

—1,0,1) give multi-instanton or anti-instanton configurations appropriately. 
The one-instanton configuration can be written as 


en 
A, (x) = OEE Cag Ow 


Zatir-@  eyrh 
a/ 42 a/ x2 


where x? = x," and e,, = (1,i7;). A is a scale factor giving the “size” of the 
instanton. Since our arguments leading to v[A] were presented in the gauge 
Ap = Aa = 0, we note that the configuration (16.43) can be transformed into 
this gauge. It then looks like 


= (16.43) 


2 

2 ad ee, ew 16.44 
A, =U (= 5 Vie ox U ; (16.44) 
U = exp (ir - Zp] (16.45) 

|az| a4 wT 
= ae i ——— } + — 16.46 
p ra arctan Tae De 9 ( ) 
SOP Ses (16.47) 
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(It is worth noting at this stage that the formulae for the gauge potentials 
do have singularities in general. To get a nonsingular description, one has to 
define different expressions for A on different coordinate patches, with the 
gauge transformations as transition functions on the intersections. Gauge in- 
variant quantities, such as Tr(F,,, Fog) and its integral, are nonsingular and 
well defined for the instanton; however, if we choose to express the instan- 
ton number in terms of K°(A) which is not gauge invariant, a patchwise 
description is needed.) 

Instanton configurations obey a self-duality (F,, = Fyv) or antiself- 
duality (F,, = —Fv) condition which is best seen using the so-called Bogo- 
mol’nyi inequality. The dual of F'll,, is given as F wy = 5Euvapl, ag: We start 
with the inequality 


i d‘c (Ft, —F2,)? >0 (16.48) 


v 


Expanding out and using the definition of the instanton number v[A] from 
(16.36), we find for the Euclidean Yang-Mills action 


A a, a 
SE aes d zL be at 
2 
> 14] , (16.49) 


This is the Bogomol’nyi inequality. (If the instanton number is negative, we 
can use the inequality (Fi, + F 4,)* = 0 to obtain the bound given above.) 
This shows that the solutions for vy = 1 which minimize the action saturating 
this bound must be self-dual. Notice that if they are self-dual or antiself- 
dual, then the equations we have to solve are first order in the derivatives 
and so they are simpler. The solutions given in (16.43, 16.47) obey the self- 
duality property. In fact, the Bogomol’nyi inequality shows that the solutions 
for any v ~ 0 must be either self-dual or antiself-dual. Of course, there are 
many configurations with v ~¢ 0 which are neither self-dual or antiself-dual, 
which are not solutions of the equations of motion. Also not every self-dual or 
antiself-dual configuration has to minimize the action; there are many which 
are just extrema of the action. 
The one-instanton configuration of (16.43) can also be written out as 


Nave 
At = Lb : 
be (x2 + 2) (16.50) 
where the three 77, is given by 
tae = —O70v4 ae 07 Ou4 ae Pas eee (16.51) 


Niv is known as the ’t Hooft tensor. 
Instanton solutions can be used to carry out a semiclassical evaluation 
of the functional integral. This will require expanding the action around the 


s 
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instanton solution by writing A = Ains:(x) + (x), where €(x) is a small 
fluctuation in the field; keeping terms in the action up to the quadratic order 
in €, one can then do a Gaussian integral to get Z[J]. This should be done, 
not with just one instanton; one should sum over all classical solutions, which 
include multi-instanton solutions. In practice, the calculation is limited to a 
dilute gas of far separated instantons and anti-instantons. Some features of 
the theory are exposed by this calculation, but this is still within the small 
coupling regime. 


16.4 Fermions on an instanton background and an index 
theorem 


Consider a four-dimensional! space M with Eucliden signature for the metric. 
We want to analyze the eigenmodes of the Dirac operator. The spinors on 
the manifold can be separated into components of left and right chirality. 


Y= 3(1+75) ¥ + 301-15) 
$i +R (16.52) 


We also define 7- D = D — Di where 


D=(y p) (+48) 
Dt = (+45) (y . Dytex (—y p)(* =) (16.53) 


We denote the set of normalizable spinor modes of left chirality on the man- 
ifold as E,; Er will denote the set of normalizable modes of right chirality. 
Since +, changes the chirality, we have the mappings 


Ill 


D> Ex 4 ER 
Di: Er, — E;, (16.54) 


For example, if ¢ belongs to E_, 


¥5(D¢) = ys(y- D)4(1+45)¢ = —(7- D)§(1+5)¢ 
=p (16.55) 


showing that Dd@ belongs to Er with y; = —1. This also means that if 
D¢ # 0, we have a 1 — 1 mapping from Ey, to ER. Likewise, Di provides a 
1—1 mapping Er — E;. Thus we have the result that the nonzero modes 
are paired up. The zero modes or the kernels of the two operators need not 
match. We define 


index(y-D) = dim(ker D) ~— dim(ker D') (16.56) 
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We can express this in another way. We cannot define the eigenmodes of 
D or Dt since these connect different spaces. But D}D : E, — Ey, and 
DDt : Er — Ep. If ¢ is an eigenfunction of D'D with eigenvalue 4, i.e., 
Dt D¢ = Ad, we get D D'D¢ = AD¢, showing that D¢ is an eigenfunction 
of DD‘ with the same eigenvalue. All nonzero modes are exactly paired up. 
Further, D'D and DD? are elliptic operators; i.e., the term with the highest 
number of derivatives is elliptic. Thus we can also define 


index(y- D) = dim(ker D'D) — dim(ker DD‘) 
_ {number of zero\ —_ / number of zero (16.57) 
~ \ modes of Dt D modes of DD? 


We assume that M is a compact orientable Riemannian manifold with 
OM = @. In our case of flat Euclidean space, this is equivalent to requir- 
ing that F,,, vanish as |x| — oo. In this case, the operators D'D and DD? 
are Fredholm operators. They are continuous operators and have finite mul- 
tiplicity of eigenvalues. Thus the index is finite. 

The index can also be written as 


index(y - D) = Tr eee arr ese? ae 
L 


= Tr [ys exp ((y 1 a are 
Te (16.58) 


ER 


The trace is over the infinite-dimensional spaces of eigenmodes and over the 
Dirac matrices. The nonzero eigenvalues and eigenmodes are paired and so 
will cancel out in the sum. For the zero eigenvalues, it does not matter what 
M? is, so we can calculate this as M? — oo. We then find 


index(y-D) = lim Tr fase") 
—+OoO 
M2-300,y 2 


4 
= faye [lule) ple 147 2] 


d*p 
= [ata Te lrsem (iAP /M? + BFert? (MY) 
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= _— ‘i d*x Tr (FuvFw) 
= (16.59) 


(Here we use the convention €4123 = €0123 = 1 and Tr(y°y#y"7%78) = 
Aev28 |) We have thus shown that 


dim(ker D) — dim(ker Dt) = —y[A] (16.60) 


16.4 Fermions and index theorem 371 


This is the Atiyah-Singer index theorem applied to the special case of the 
spin complex in four dimensions. The general index theorem gives an inte- 
gral formula for the index of any elliptic complex in any even number of 
dimensions. 

The index is a topological invariant, i.e., it is invariant under small changes 
of the potential A,,. If we write the index as a surface integral given by 


v[A] = ¢ dd, Ke 


Kt = -gatk (4.0043 + FA, AsAs sil (16.61) 


we can easily check that 


pee) Se) — ia $ d33,, T(E, Fag etv? 
= [A] (16.62) 


using the fact that Fug — 0 as |z| — oo. ¢ dU, K* is not zero by this 
argument, since it is not gauge-invariant and A, does not have to be zero, 
but can be a pure gauge potential as |z| — oo. Thus the situation is as 
follows. For a given potential A,, D'D and DDt have, say, n+ and n_ zero 
modes, respectively. What we have shown is that ny —n_ = —v[A], so that, 
except for |v{A]| zero modes, the remainder are paired up. The paired zero 
modes, if any, are generally not preserved under small changes in A,. Thus 
they disappear, become nonzero modes, when we make a small change in the 
potential. The unpaired zero modes are preserved under small deformations 
of the potential A,. 
Consider now the functional integral over fermions. We have 


eG / du(A)[drbdrp] e*” exp (-sia = / iy Dv) 
7 > Z, (16.63) 


For the sector with v = —1, we can write 
Z\ = / dy(A)[dbdy] e~? exp (-sia / by De) (16.64) 
The Dirac Lagrangian can be split as dy - Dy = 6, Dé¢r — ¢rD' or where 
bx € Ex, or € Eb 
or € Ep, or € Ej (16.65) 


Based on this, we introduce the mode expansions 
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dr = ado + > bndn 


or =ad5 + > endh, (16.66) 


where ¢o is the zero mode and the coefficients a, @, bn, bn, CnEn are Grassman 
variables. The action does not involve a, @ because ¢o is a zero mode. In fact, 
for the Dirac action, we get 


Sp = we Padbm [6D bm = tncm | 6D" xm| (16.67) 


The fermionic measure of integration contains the factor dada. Thus 


A= faut) [dbdbdedz|dada e~*’ exp (—Sp(b,c, b, 2) — S(A)) 
= (16.68) 


since f dada = 0 for Grassman variables a and @. The one-instanton (and 
more generally the many-instanton) contribution to the functional integral 
is zero if there are massless fermions. The nonzero amplitude must involve 
éroér, which can produce factors of a and @. We find 


bro = a0 f 6360 ++ 


(oro) = [ eaaa aa (/ d26560) x 
/ du(A)[dbdbdede] e~*” exp |—Sp(b, ¢, b, 2) — S(A)] 


(16.69) 


More generally, there are —y zero modes and we have to have a factor 
like (¢r¢,)~” so as to get a sufficient number of Grassman variables for 
a nonzero amplitude. If there are several species of fermions, we have zero 
modes corresponding to each species and so we must have a factor like dr@y 
for each species to have a nonzero value for the integral. By the antisymme- 
try properties of the Grassman variables, we see that the nozero amplitude 
involves det(¢z¢z). The result of the integration over fermions can then be 
represented as an effective interaction 


Sefp = —i v | 08 det drdz (16.70) 


16.5 Baryon number violation in the standard model 373 


This is the ’t Hooft effective interaction. The effective action which we ob- 
tained for the axial anomaly is closely related to this. It provides one way 
to understand the U(1)4 problem in QCD; the extra mass of the 7’ parti- 
cle is essentially due to this interaction induced by the instantons of QCD. 
A similar term generated by the instantons of SU(2) x U(1) gauge theory 
of electroweak interactions leads to baryon-number-violating effects in the 
standard model. . 

Under the axial U(1) transformation ¢, — e~'“d,, dr — e’“dp. In the 
fermionic measure of integration, the exponentials from the transformation 
of the b’s and c’s cancel out. The measure thus transforms as 


[dada] [dbdbdcde| > e?** [dada] [dbdbdcde| (16.71) 
In general, for instanton number v, we get 


[dvdd] + e~ 2” [dar] 
= exp (2ia(Trys)) [dydy] (16.72) 


where we have used the definition of the functional trace over 5 in (16.58). 
The exponential factor can be interpreted as the Jacobian of the axial U(1) 
transformation ~ — exp(—iysa)w. (The Jacobian for the transformation of 
the fermionic measure is the inverse of the Jacobian for bosonic fields, which 
explains the sign in the exponent.) Thus, although the action is invariant 
under axial U(1) transformations, the measure of integration is not and this 
leads to the axial anomaly. The integrated version of the axial anomaly is 
the index theorem (16.60). This gives a new understanding of anomalies as 
arising from the lack of invariance of the functional measure; this will be 
discussed in more detail in the next chapter. 
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We have seen in our discussion of anomalies that baryon number (B) and 
lepton number (L) are not good symmetries in the standard model, being 
violated by anomalies. Only B — L is a good anomaly-free symmetry. Thus 
the standard model does contain baryon-number-violating interactions. We 
can use the index theorem to show how this arises in some detail. 

The violation of baryon and lepton numbers was obtained in Chapter 13 


as 
0, JF = O,J¥# = cg[b] — 2 cole] (16.73) 


The expression —c2[b] is the instanton density for the SU(2) gauge field of 
weak interactions. Consider, therefore, a gauge-field configuration for which 
this SU(2) instanton number is not zero, say, y = —1, as an example. In this 
case, there are zero modes for all the massless fermion fields which couple to 
the SU(2) gauge field. In the standard model with one generation of quarks 
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and leptons, the relevant fermions are the left-handed quark doublet of u and 
d quarks and the e, vy. doublet. The quarks come in three colors, which is just 
a degeneracy factor for the coupling to SU(2) x U(1) gauge fields. Thus we 
expect three quark zero modes, one for each color, and one lepton zero mode. 
Each field can be expanded as 


PL = ago - SS bndn apse 


on ene (16.74) 


Since the Grassman coefficients anticommute with each other, we need an- 
tisymmetrization of the fields to get a nonzero value. Keeping this in mind, 
the nonvanishing amplitude in the one-instanton sector is of the form 


Cabeeapense Ore 2B (2) QE (x)l>,(2)) 
= f aulA,v,8 cP epeugense” OH) "8 (2) QC? (2)l9,(z) 
(16.75) 


Here a,b,c refer to color indices; a,...,6 are SU(2) indices; and A,...,D are 
Lorentz indices for the two-component left-handed fields. Notice that the 
quantity on the left-hand side is invariant under color SU(3), weak SU(2), 
and the hypercharge U(1)y as well as under Lorentz transformations. This 
combination will give the right number of Grassman coefficients for the zero 
modes to make the integral nonzero. The actual value of the integral is small 
since the action for an instanton is at least 817/g?, where g is the SU(2) 
coupling constant. Thus we get a suppression factor exp(—87?/g") from the 
exponential, if we expand around the one-instanton configuration. Since g is 
small, this factor is very small. If we consider an amplitude similar to (16.75), 
but with a number of Higgs fields also at the same point 2, effectively it 
corresponds to a local baryon number violating interaction with a number of 
accompanying Higgs particles. The total phase space integral for the cross 
section can give an enhancement which can compensate for the exponential 
suppression to some extent. Nevertheless, the rate remains small in general. 
The question of whether it may be sufficient to explain the baryon asymmetry 
of the universe is interesting; it seems it is inadequate unless there is further 
enhancement due to the characteristics of the electroweak symmetry breaking 
phase transition. 
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17.1 Anomalies and the functional integral 


Anomalies arise when a classical symmetry cannot be realized in the quan- 
tum theory because there is no choice of regularization which preserves all 
the required or desired symmetries. We have calculated the anomalies in 
perturbation theory using Feynman diagrams. Fujikawa has given an ele- 
gant interpretation of anomalies in the functional integral language. As we 
have seen, a quantum field theory can be defined by the functional integral 
of the exponential of the Euclidean action. All questions of regularization 
and renormalization are equivalent to the problem of a proper regularized 
definition of the functional integral. There are two basic ingredients in the 
functional integral, the classical action and the functional measure of inte- 
gration. In a situation with an anomalous symmetry, the classical action has 
the symmetry. Fujikawa’s observation was that the quantum anomalies can 
be understood as arising from the nontrivial transformation of the measure 
under the symmetry transformation. 

Consider the functional integral for a set of massless Dirac fermions vec- 
torially coupled to a nonabelian gauge field V,,. 


a= if [dba] eS) 


S(y,0) = / be by: (O+V0 (17.1) 
The classical action has the chiral U(1) symmetry 
oe, ps pe (17.2) 


where the parameter 6 is independent of 7“. We now consider a change of 
variables in the functional integral given by o)’ = e7*%59() yp, yy! = p e~*89), 
where 6(x) is taken to be an arbitrary function of the coordinates. Z is 
unchanged under this since it is just a change of variables. Thus 


hm / [dep dep’) e~SH'-¥) 
= [lava det(e?"75°) e S(%) exp |- [ae a(2)9.J| 
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= flavaa) eS) exp pete(a50) — pas 00,45 (17.3) 


where J . is the U(1) axial vector current 


Jus = yrs (17.4) 
Se re 

(Since we have Grassmann variables, the Jacobian is the determinant of the 
inverse of the transformation.) The invariance of the classical action gives 
the classical conservation law 0,J,5 = 0. Notice that in (17.3) one gets an 
additional contribution, det(e?*7°), which is the Jacobian of the transfor- 
mation of the measure. Comparing the result (17.3) with (17.1), we get the 
Ward-Takahashi (WT) identity 


/ [dypayp] eS) i (one 2iTe(56) =0 (17.5) 


Tr(7s9) involves the functional trace and a trace over the Dirac matrices and 
must be calculated with proper regularization. This has been done in the 
previous chapter and gives the result 


1 = < 
Tr(158) = 5 / de 6 Te(Fy Fy) (17.6) 
The WT identity now becomes 
aA a + i z 
/ [debdy] e~ SM.) / d‘z 0 lauds — soe THF) =O. (17 


The WT identity for correlation functions can be obtained by the same pro- 
cedure with sources inserted in the functional integral. The lack of conserva- 
tion of the U(1) axial current shows that the symmetry is not obtained at 
the quantum level. The variation of the quantum effective action can be read 
off from the above equation as 


4 = 
Oo = ‘i d‘x 0 a Ts Bay Fp) (17.8) 


This is in agreement with our calculation using Feynman diagrams in Chapter 
dae 

We have shown the calculation for the U(1) transformtion. This approach 
to the anomalies can be used for nonabelian symmetries as well. In this case, 
one finds Tr(75@) again, where 6 = —it?6%(x) is a parameter of the nonabelian 
transformation and the trace involves a functional trace, Dirac trace and a 
trace over the products of Lie algebra matrices. One can also include vector 
and axial vector gauge fields. The result of the calculation is the Bardeen 
form of the anomaly which was given in equation (13.33). 
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12 Fay Fag 


1 
Tr(759) = -za | te ehvoB Sty 


1 
of fa Fan aie 
As 
—3(ApAL Frag “8 ApFyvaAg tr Fy wAaAsg) 


8 
+5 AnAedads (17.9) 


(The axial anomaly will be 2i times Tr(7s56); Fy, and Fayv are given in 
Chapter 13, equation (13.2). ) 


17.2 Anomalies and the index theorem 


There is a simple and very elegant relationship between anomalies and index 
theorems for differential operators which occur in the kinetic terms of particle 
Lagrangians. We have already seen an example of this in the last chapter 
where the index of the Dirac operator in four dimensions was given by Tr(7s), 
which is the integral of the anomaly for the axial U(1) transformation. This 
result generalizes to other dimensions. 

Let M be a 2n-dimensional manifold with Euclidean signature. One can 
define a set of Dirac -y-matrices by the algebra 


Yu Ye if Aan = 2 Owl (17.10) 


One can realize them explicitly as (2” x 2”)-matrices. The chirality matrix 
or the analogue of 5 matrix is then given by 


Yanti = tM1Y2°°* Yan (17.11) 


(Here we have slightly changed our definition of 72,41 compared to our def- 
inition of ys in four dimensions. In four dimensions, it was convenient to 
have €°1?3 = 1; in generalizing, it is easier to take €12---(2n) = 1.) Fora 
Dirac spinor in 2n dimensions with 2” components, the chiral projections are 


defined by : 
p+ = 5 + Yant1) (17.12) 


The index of the Dirac operator + - D gives the number of normalizable zero 
modes of positive chirality (n+) minus the number of normalizable zero modes 
of negative chirality (n_). This is given by Tr(72n+41). One can show by direct 
calculation 


ny —- n= i A(R) \ Ch(F) (17.13) 
M 


where A(R) is the characteristic class A-genus and Ch(F) is the Chern char- 
acter defined in Chapter 14. This result is a special case of the very general 
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index theorem due to Atiyah and Singer. The axial U(1) transformations are 
given by 
y — exp(—ty2n419) (17.14) 


and the anomaly of the theory under these transformations is given by 
2 Thao / 9 A(R) AOMP) (17.15) 
M 


The index density gives the anomaly. 

The anomaly for the nonabelian transformations is also related to the 
index density. As we shall show below, for a fermion theory in 2n dimensions, 
it is the index density in 2n + 2 dimensions, which we will denote by Z2n+2, 
that is relevant. Thus for four dimensions we must start with the index density 
corresponding to six dimensions. It is given by 


5 eS ere 


2 
ra seagutF THR? (17.16) 


(Wedge products are understood, we have omitted them to avoid cluttering 
the notation.) Concentrating on the TrF* term first, notice that we have 
dTrF? = Tr(DF F?+F DF F + F? DF) =0 by Bianchi identity. Thus, we 
expect that, at least locally, we can write 


7) 
A873 


TrF? = dws (17.17) 


ws defined by this equation is the Chern-Simons five-form. There is a neat way 
to calculate this and other Chern-Simons forms which we will now describe. 
Let A, = sA, where s is areal parameter between zero and one. The curvature 
or field strength corresponding to this is F, = sF'+(s?—s)A?; alsodF,/ds = 
dA + 2sA? = D,A, where D, is the covariant derivative with respect to A,; 
i.e., Ds>C = dC + sAC + CsA for a one-form C. The Bianchi identity is 
D; \ F; = 0. Evidently we can then write 


grti 
(Qn)"*1(n + 1)! 


gnti 


al 
d 
Tero -| ee 
| ©! a AT 


nm+1 
Tee 


1 n+l 
2 
— I ————— eee 7 
/ s QnyrHin +1)! (n+1)TrD, AF; 


1 ; grt 
2 i: Teceresyicae 
= dwon+1 (17.18) 


where 
n+1 


1 ‘ 
a Le ; 
W2n4-1 =| 4S oa oe (17.19) 
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Using F, = sF + (s? — s)A?, we can evaluate the integral to obtain the 
Chern-Simons form w2,41. Explicitly, for n = 2, we find 


i 3 -_ 
ws5(A) = = (aaaaa + pAdA + 24°) 
= wer AF? — 1 3p fe as 17.20 
4873 y} 10 (p20) 


ws is not gauge-invariant; however, since its derivative is proportional to TrF 
and this is gauge-invariant, we have d[ws(A9) — ws(A)] = 0, where AY = 


gAg~' —dgg™' is the gauge transform of A. This shows that w5(A9) — ws(A) 
is a closed five-form. It can be explicitly calculated as 
A®) — we( AVS doy + —*—Tr(ag p=) 
w5(A%) —ws(A) = dag + 43023 Tr(dg 9") 


— —1 1 1 143 
4 =—-i3 Ta dg (Ada + 5dAA + 54°) 


— 7 ive 
+9 ‘dg Ag ‘dg A) A ‘ag)*A| 
(17.21) 


a4 is a four-form. For infinitesimal transformations, g ~ 1-6, A— A+ D6, 
we can write 


D 
4873 


a4 & Ty as (Gada 5dAA + 54°)| (17222) 
Comparing with (13.31), we see that we can write the anomaly for a fermion 
of left chirality as 27i f az. 

We can regard ws5(A9) — ws(A) as giving the integrated version of the 
anomaly or the change in the effective action under a finite gauge transfor- 
mation g. The WZ consistency conditions are the integrability conditions for 
composing infinitesimal transformations to obtain the change of the effective 
action under a finite transformation. Since w5(A9) —ws5(A) already furnishes 
an integrated version, it is clear that expression (17.22) will obey the WZ 
consistency conditions. 

The result for finite gauge transformations may be represented neatly as 
follows. On the four-dimensional space, we are interested in fields obeying 
the condition F — 0 and g — 1 as {z| — oo. Thus M is effectively compact 
with no boundary (~ S*) and we can consider a five-dimensional space D (a 
unit disc in R>) with boundary 0D = M = S%*. Let U denote an extension 
of the function g from spacetime to D, so that U restricted to 0D = M is 
g. The change in the effective action under a finite gauge transformation can 
be expressed as 


Aron ip lws(AZ) — w5(A)] (17.23) 
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Equivalently, we may write for the Dirac determinant of a fermion of left 
chirality 


det(y - D9) = det(y- D) exp (2ni f [ws5(AY) — ws(A)]) (17.24) 
D 
In [ws(AY) — ws(A)], we encounter the term ~ »>* «- 


(5)_ __? —1)5 

Lp ee 7003 Tr(dU U~*) (17.25) 
As shown in Chapter 14, this five-form is closed but not exact; it is the 
pullback to D of an element of H°(G). Therefore (17.23) cannot be integrated 
to get something defined on the four-dimensional spacetime. This shows that 
the anomaly cannot be eliminated by a local counterterm, thus completing 
the identification of the four-dimensional anomaly in terms of the the six- 
dimensional index density. The expression 27i [,[ws(AY) — ws5(A)] was first 
obtained, in a different form, by Wess and Zumino by integrating the anomaly 
and is therefore referred to as a Wess-Zumino term. The form in which we 
have given it is due to Witten. Since the differential forms do not involve 
metrical factors, equations (17.23- 17.25) only need M to be topologically 
S*. Thus the formulae apply for MM = R?4 with the field strengths vanishing 
as |x| — oo. 

The specification of the finite anomaly (17.23) requires the extension U 
of g from the four dimensions to the five-dimensional space D. (The gauge 
fields only occur in a4 and hence need no extension.) There are many ways 
to extend g to the disc D. Consider two different extensions U,, U2 We can 
think of spacetime as the equator of a five-sphere and the two extensions as 
U’s defined on the upper and lower hemispheres. In other words, we take D 
to be the upper hemisphere for the first extension and the lower hemisphere 
for the other extension. We then see that the difference in the finite anomaly 
is given by 


AI(U;) — AP(U2) = $ aU) (17.26) 
5 


where U = U for the upper hemisphere and U = U2 for the lower hemisphere. 
On the equator U; = U2 = g, so there is no difficulty of continuity of the 
functions on S°. The integral in (17.26) gives the winding number of the map 
U : S° — G considered as an element of IJ5(G). This is an integer and so 
the ambiguity of different extensions will not affect equation (17.24) or the 
exponential of the effective action. 

The reason we cannot eliminate the anomaly by a counterterm has to do 
with the term 2), which is an element of H°(G) for the gauge group G; it 
cannot be written as an exact form and so it does not become the integral 
of a local function on spacetime. H°(G) is zero for all compact Lie groups, 
except for SU(n), n > 3. In four dimensions anomalies can possibly occur 
only for these gauge groups. 
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17.3 The mixed anomaly in the standard model 


The relationship between the higher-dimensional index density and the 
anomaly for nonabelian transformations holds in general. We can in fact 
generalize the formula (17.24) as 


det(y - D9) = det(y- D) exp (2nilvanss(A”) a wania(A)]] (Ee27) 


In general, there can also be gravitational anomalies corresponding to the 
quantum breaking of the symmetry of coordinate transformations or diffeo- 
morphisms. This can also be obtained from the index density in higher di- 
mensions. Since the index density Zo,42 corresponding to 2n + 2 dimensions 
is a closed 2n + 2 form, we define 


Lont2 = dwen+i (17.28) 


where the Chern-Simons form w2,41 can depend on the gauge and gravita- 
tional fields. Let ©2,+41 denote the Chern-Simons form with the transformed 
gauge and gravitational fields. For the left chiral Dirac determinant on a 
2n-dimensional space M2", we then have 


det (y -D) = det(y- D) exp (2ni [leans — vant) (17.29) 


where OD = M?". The index density Z2n+42 is referred to as the anomaly 

polynomial for 2n dimensions. It is very useful in checking for the absence of 

anomalies; if this polynomial vanishes when the traces are evaluated over the 

assigned representations of the fields, then the anomaly is zero. This is much 

easier in practice than using the explicit formula for the anomalies. This is 

the procedure used for verifying anomaly cancellation in string theories. 
The full index density in four dimensions is given by 


1 2 
812 ailllen 19272 
This shows that chiral fermions in two dimensions can have gravitational 
anomalies as well as anomalies under nonabelian gauge transformations. 
The index density corresponding to six dimensions given in (17.16) shows 
that, in a four-dimensional theory, we can have gauge anomalies and anoma- 
lies which mix gauge and gravitational fields. This means that the expression 
for the change in the effective action under a coordinate transformation is 
not zero unless the gauge field is zero.) There are no purely gravitational 
anomalies in four dimensions. 
We have already checked the absence of gauge anomalies for the standard 
model. However, since the U(1)y charge is nonzero for each fermion, the 
mixed anomaly can potentially occur for the standard model, due to the 


Ig = Ay fad (17.30) 
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TrF term in the formula (17.16). Actually, with the assigned charges of the 
fermions, TrY = 0 if taken over the quarks in each generation or over the 
leptons in each generation. Thus the mixed anomaly is also zero for the 
standard model; one can consistently couple it to gravity. 


17.4 Effective action for flavor anomalies of QCD 


The theory of strong interactions, namely, QCD, involves quarks and glu- 
ons. These are eventually bound into states which are color singlets which 
are in turn mesons, baryons, and glueballs. They are observable as asymp- 
totic states. This picture of confinement of colored states leads to certain 
requirements on the effective theory of mesons and baryons. For example, 
some of the electroweak anomalies cancel between quarks and leptons. In the 
low-energy regime of confined quarks, the leptonic anomaly is unchanged. 
There must therefore be some terms in the effective action for mesons and 
baryons which represent the flavor anomalies of QCD and which ensure the 
cancellation of the leptonic anomaly. Otherwise, the picture of confinement 
cannot be consistently implemented. One could turn this around and argue 
that for any theory, not necessarily QCD, there must be matching of anoma- 
lies between different phases of the theory. This anomaly-matching condition 
was first proposed by ’t Hooft as a way of understanding phases of a gauge 
theory. There is one potential caveat to this argument as we have outlined it. 
The anomaly arose as a high-energy effect, having to do with regularization 
of short-distance singularities. So one might argue that there is no reason for 
it to be the same in the low-energy effective theory. The topological nature 
of the anomalies shows that they are invariant under change of scales and 
so should be the same for the low-energy theory as well. In other words, if 
anomalies cancel between two sectors of the theory at high energy, they must 
also cancel at low energies, even if one sector is in a different phase. There 
is another elegant argument leading to the same conclusion. In the trian- 
gle diagram which leads to the anomaly, one can check that the anomaly is 
due to the imaginary part of the diagram. The imaginary part of a one-loop 
diagram, by the unitarity result (12.111), is related to the tree-level cross 
section, which can be calculated in terms of asymptotic states in the effective 
theory. Therefore, the anomaly can be viewed as a high-energy effect or as a 
low-energy effect. For this reason, we must demand a matching of anomalies 
between the high- and low-energy regimes of a theory. 

Coming back to the specific case of QCD, we may thus ask how the flavor 
anomalies are represented in terms of mesons and baryons. The effective 
theory arises from the chiral symmetry breaking Uz(3) x Ur(3) — Uy(3) 
and involves the meson field U, as discussed in Chapter 12. The Goldstone 
bosons behave in a way similar to the parameters of gauge transformations. 
(This is why they can be absorbed into the gauge field in a unitary gauge 
for the Higgs mechanism.) Therefore, we can expect the effective action for 
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QCD to have a term of the form ws(AY) — ws(A), where U is the Goldstone 
field for chiral symmetry breaking and A’s are flavor gauge fields. This is 
almost but not quite right. The reason is that this expression would give the 
anomaly for a set of left-handed fermions and so would be appropriate for a 
symmetry breaking pattern Uz(3) — 1. The chiral transformation rule for U 
is U += U'=9, U on From (17.21), we see that the relevant term involving 
U should thus be of the form 


N ; 
Iwz — -igga | (Tr(dU U~*)°+2nN[a4(U~!, Ar) —aa(U, Ap) ++ Deane 
(17a) 


where we have a factor N corresponding to the number of colors which are just 
the degeneracy for quarks for the computation of flavor anomalies. For QCD, 
N = 3, but we display the more general result. [.oynz are extra counterterms 
which can be added, if necessary, to ensure that the flavor anomalies for the 
nongauge directions are expressed in a gauge-invariant way, analogous to the 
Bardeen form of the anomaly. This term can be worked out by considering 
transformations of yz, and the full result is 


= 1\5 
Iwz=- ==: re ) 


“aa |, Tr [((ArdAr + dAzrAz A 7 )aUUa 


+4 ise a Tr[(AndAr + dApAr + A3)U— dU] 


86 ver J, Tr[A,dUU~*A,dUU~* — AgU~*dU AgU~*dU] 


~1)8 — Ap(U~'dU)?] 


“ie 


“B a mf my Tr[dA,dU ApU~! — dAgd(U-)ALU] (17.32) 
T 


B isa J, Tr[ApU~!A,U(U—!dU)? — ALUARU + (dUU *)?| 
vis 


oS ah Tr{(dApArg + ArdArg)U~'ALU 


—(dA, Az + A,dA,)UARU "| 


ae Ea |, Tr[A,UAgU~'A,dUU~' + AgU~* A,UARU*dU] 


3 al 3 = 
- 7a |, Bl Tr[A3U 1 ALU — ASU ARU™! 


+5UARU- 14,UARU- TAT] 


This effective action, first given by Witten, will describe all processes which 
are mediated by flavor anomalies. It is odd under U @ U~?, Ay ~ Ap as it 
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should be. The combination U — U~!, 2 — —a is a symmetry, except for 
the Ay, Ar gauging. This corresponds to parity, which is preserved by the 
strong interactions. The gauge fields of weak interactions break parity sym- 
metry, so there is no reason to expect this symmetry with Ar, Ar. The term 
proportional to Tr(d/U~')® can describe certain purely mesonic processes 
due to the anomaly; an example is K+ K~ > rta~7°. 

The gauge fields in (17.32) are to be evenually restricted to the electroweak 
gauge fields. If we consider only the electromagnetic field, the terms which 
depend on the gauge field are 


ee il Tr[A(dUU71)3 + A(U~*a0)" 


<= 
16x? 


Hee sg fT Tr[dAA(dUU~* +U- 140) 


= / Tr[dAdU AU~! + dAU AdU~} (17.33) 
3272 M 


By expanding U, it is easily checked that this leads to the effective term for 
7° —+ 2y decay, which was separately analyzed in Chapter 13. 


17.5 The global or nonperturbative anomaly 


The anomalies in symmetry transformations discussed so far can be expressed 
in terms of the change in the effective action or the fermion determinant un- 
der an infinitesimal transformation. This change is computable in terms of 
Feynman diagrams or the Jacobian of the fermion measure for infinitesimal 
transformations. The change for finite transformations is obtained by inte- 
grating the anomaly. The consistency conditions for integrability are the WZ 
conditions given in Chapter 13 and the integrated form is obtained as in 
equation(17.23) or (17.29). But there can be transformations which cannot 
be continuously connected to the identity and so cannot be obtained by inte- 
grating infinitesimal transformations. It is possible for the fermion measure 
or the fermion determinant to change in a nontrivial way under such a trans- 
formation. This was shown by Witten in the classic case of an SU(2) gauge 
theory in four dimensions. This is generally referred to as a global anomaly 
or nonperturbative anomaly. 

For an SU(2) gauge field in four dimensions, there is no perturbative 
anomaly, since d%°° ~ Tr(t2¢°e + ¢24°t®) = 0. Thus one might expect an 
SU(2) gauge theory with one doublet of chiral fermions to be a consistent 
theory. Witten showed that such a theory would be inconsistent due to the 
global anomaly. 

For the gauge theory under consideration, the gauge transformation is 
an SU(2) matrix g(x) : R* — SU(2). As usual, we will consider trans- 
formations with g — 1 as |x| — oo. Functions g(x) with this condition are 
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equivalent to functions from S* to SU(2). The possible homotopy classes of 
these are given by I4[SU(2)], which is Z2. This means that there are two 
types of gauge transformations for an SU(2) gauge theory in four dimen- 
sions. The first type of transformation corresponds to the trivial element of 
IT4[SU (2)] = Ze and hence such transformations are all smoothly deformable 
to the identity everywhere on spacetime. They are thus connected to the iden- 
tity and any anomaly they can lead to can be computed from infinitesimal 
transformations. (For SU(2), there is no such anomaly.) The second type of 
transformation belongs to the nontrivial element of Zz; and hence, by def- 
inition, these transformations are not smoothly deformable to the identity 
everywhere on spacetime. Let g(x) be a transformation which belongs to 
the homotopically nontrivial class of transformations. Because the homotopy 
group is Zz, two successive transformations by g(x), which is equivalent to 
a transformation by g?(zx), will correspond to the trivial element of Zz and 
can be deformed to the identity. 

The change in the fermion measure under g cannot be computed from 
infinitesimal transformations. The gauge potentials A and A9 correspond 
to the same physical configuration. The configuration space of potentials is 
connected and one can therefore construct a path A(z,7) from A(xz,0) = A(z) 
to A(z,1) = A9(z). By using an index theorem, Witten then showed that 
the fermion measure changes sign as we go from A to A9, 


det(y - D9) = — det(y- D) (17.34) 


This renders the theory inconsistent since one cannot restrict the integration 
over the gauge fields to regions where one sign is obtained for the determiant. 

We will now show this result by a different argument which relates the 
global anomaly to the usual perturbative anomaly. We think of the gauge 
group SU(2) as being embedded in SU(3). Since II,[SU(3)] = 0, there is no 
transformation which is not deformable to the identity and hence there is 
no global anomaly in this enlarged theory. However, while SU(2) is pertur- 
batively anomaly-free, SU(3) is not. Thus the global anomaly of the SU(2) 
theory can be recovered from the usual result (17.24). When we go to SU(3), 
we must consider a triplet of fermions, since the original fermions form a 
doublet under SU (2). The extra added fermion field is a singlet under SU(2) 
and so it will not affect arguments regarding the consistency of the theory 
for SU(2) transformations. Under a general SU(3) transformation A — A%, 
we have 


det(y- D9) = det(y- D) exp [i '(U, A)] 
r(U,A) =2n [ [w5(A") — ws(A)] 


= Qn | ip aa(g, A) + [ 2(u)| (17.35) 
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where U has the property that U = g on OD = M. The gauge fields occur 
only in a4 and we can keep them as SU(2) gauge fields. a4 is then zero, 
since the perturbative anomaly is zero for SU(2); and so, for the rest of our 
argument, we can take 


r(u,A) =U) =2n 2 AY), (17.36) 
D 


If we consider two extensions of g into the disc D, say, U and U’ with 
the same boundary value g, then the diference [(U) — ['(U’) is 27 times 
an integer by the arguments given after (17.26). This integer is the winding 
number of the map S° — SU(3), which labels the elements of J75{SU(3)] = Z. 
Alternatively, we can write U'’ = UV, where V : D — SU(3) with the 
condition V = 1 on OD = M. Such a function is equivalent to a map from 
S® to SU(3) and so we get the winding number for S° — SU(3) again. Let 
V, denote the basic map generating all the nontrivial maps D — SU(3), with 
V, =10n OD= MM. V, has winding number 1 and so 


T'(VYy) = 20 (17.37) 


Once we have shown the consistency of different extensions of g into D, 
it is sufficient to consider only one extension U in the formula (17.35). U : 
D — SU(3) gives a map of the disc D into SU(3). Instead of pulling back 
the five-form to D via this map and integrating, we can think of (17.36) as 
the integral of 2°) over the image disc in SU(3). 

We now argue that this can be regarded as an integral over a sphere in 
SU(3)/SU(2). We can easily check that [(Uh) = I'(U) for any h € SU(2) C 
SU(3), since the perturbative anomaly is zero for SU(2). Thus I’ is defined 
on maps from D to SU(3)/SU(2). Since the boundary value of the functions 
U is in SU(2), the image of the disc D, while it is a disc in SU(3), is a sphere 
S° in SU(3)/SU(2). Further, the value of the integral '(U) is invariant under 
small deformations of U, since the integrand is a closed two-form. Thus it is a 
topological invariant for the maps S° — SU(3)/SU(2). It gives a real number 
for every element of Is5[SU(3)/SU(2)] = Z. (The space SU(3)/SU(2) is the 
five-sphere S°.) We also see from expanding out the five-form 


ru’) =r(v) + ru’) ~ (17.38) 


for maps U, U’ with boundary values in SU(2). 

Consider now the nontrivial SU(2) transformation g(x). There is no way 
to extend this over the entire disc D staying within SU(2), but we can find 
U : D — SU(3) such that U = g(x) on OD = M. For the transformation of 
the fermion determinant we need (VU). This number can be evaluated using 
the exact homotopy sequence discussed in Chapter 14; the part of sequence 
which is relevant for this case reads 
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0— S41, [sv@)| — 0; [se | + I4{SU(2)] + 0 


Z- Z- Z2— 0 (17.39) 
2k+1— 1- 0 
n—- 2k — 0- 0 


The exactness of the sequence tells us that what maps to zero must come as 
the image of the previous map. Therfore the nontrivial element 1 of Z2 must 
be the image of some elements of IJ5{[SU(3)/SU(2)]. None of those elements 
can come as the image of the previous step, so all of I5[SU(3)] must map 
to some other elements of I75[SU(3)/SU(2)]. Given the composition rules 
for the homotopy elements, we then see that all even winding numbers in 
ITs[SU(3)/SU(2)] are images of [73[SU(3)]. The odd winding numbers of 
II5[SU(3)/SU(2)] are not images of elements in J75[SU(3)] and map onto 
the nontrivial element of 14[SU(2)}. 

There are many U’s giving the same boundary value g, corresponding 
to different elements of I5[SU(3)/SU(2)]. Let U, denote the extension of g 
which is the basic nontrivial map of winding number 1 in I5[SU(3)/SU(2)]. 
The function U? corresponds to the trivial element of I74[SU(2)] and can 
be deformed to the identity on the boundary OD. Since the boundary is the 
identity and not any SU(2) element, the corresponding image disc is a sphere 
in SU(3) and hence, using (17.37) and (17.38), we have the result 


rb) = 5 [rG) + re) 


it x 
5 


=5 rv) 


anes (17.40) 


This gives the result (17.34). 

In the standard model, there are an even number of doublets of fermions 
for the SU(2) group of weak interactions. Thus the standard model is free of 
the global anomaly. In seeking extensions of the standard model, by changing 
gauge groups and particle contents, one must ensure the absence of global 
anomalies as well as the perturbative anomalies for gauge transformations. 
This can impose some additional constraints on the model. 

It is easy to see that these arguments for the global anomaly can be 
generalized to other dimensions as well as other representations of fermions. 
For a gauge group G in n dimensions, there are gauge transformations which 
are not connected to the identity if I7,(G) is nonzero. From the homotopy 
groups of Lie groups, some of which were listed in Chapter 14, this can be 
seen to occur in many dimensions. The investigation of whether there is a 
global anomaly for the given fermion representations can be carried out in a 
way similar to what we have done here. 
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17.6 The Wess-Zumino-Witten (WZW) action 


The Wess-Zumino-Witten (WZW) action is intimately related to anoma- 
lies in two dimensions and has many applications. It is used for nonabelian 
bosonization. It defines a conformal field theory in two dimensions; various 
rational conformal field theories can be obtained as either a WZW model or 
gauged versions of it. 

The field variables of the WZW action are invertible matrices M(z), ie., 
elements of GL(N,C) or suitable subgroups and cosets of it, so it may be 
regarded as a particular type of sigma model. We will denote this target space 
by G. We shall discuss the action in two-dimensional space with Euclidean 
signature; the Minkowski version is briefly discussed in the Chapter 20 in the 
context of geometric quantization. The action is given by 


i 
Swzw = = iis dx./G 9 Tr(daMa,M~*) + IM] (17.41) 


I'[M| = =| dr e#¥@Tr(M—'0,MM~'d,MM~'0,.M) (17.42) 
127 M3 


Here M? denotes the two-dimensional space on which the fields and action 
are defined. It can in general be a curved manifold with a metric tensor 
Gab. (g2? is the inverse metric and g denotes the determinant of gap as a 
matrix.) M? will be taken as a closed manifold. One can also use this for 
fields on R?, by choosing the boundary condition M — 1 (or some fixed 
value independent of directions) as |x| — oo; topologically, such fields are 
equivalent to fields on a closed manifold. ['[M] is the Wess-Zumino term. It 
is defined by integration over a three-dimensional space M* which has M? 
as its boundary. The integrand is a differential three-form; it does not require 
metrical factors for the integration. However, it requires an extension of the 
fields to the three-space M3. There can be many spaces M? with the same 
boundary M?, or equivalently, there can be many different ways to extend 
the fields to the three-space M3. If M and M’ are two different extensions 
of the same field into the three-space, we write M’ = MN, where N = 1 on 
M?, the boundary of M°. By direct computation, we observe that 


P(MN] = r(M] + rpj- 2 _ Pe €°Tr(M-19,M d,NN-) (17.43) 


The last term vanishes for N = 1 on M? = 0M3. 

Since N = 1 on the boundary of M3, N is equivalent to a map from a 
closed three-space to G. In general, there are homotopically distinct classes 
of such maps. For example, if we take M? = S? (or R? with the boundary 
condition indicated), M? is a ball in three dimensions. With the prescribed 
behavior for N, it is equivalent to a map from the three-sphere S* to G. 
As we have seen many times before, the homotopy classes of such maps are 
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given by II3(G). If G contains any compact nonabelian Lie group, then this 
is nonzero. In particular, 1J3(G) = Z for all simple nonabelian Lie groups, 
except for SO(4), in which case it is Z x Z. The winding number of the map 
N(z) : S? — G is given by 


1 
2412 


Q[N] = $ ide oTy(N-10,NN—0,NN—0,N) (17.44) 
Ss 

Q[N] is an integer for any N(x). Thus, for [[N], we have two cases to discuss. 
I(N] is zero for N close to identity, to linear order in ONN~1; hence, by 
successive transformations, ’|M] is independent of the extension to M? for 
all N connected to identity, i.e., for N belonging to a homotopically trivial 
element. On the other hand, if N is homotopically nontrivial, the integral 
I[N] gives 277 times the winding number of the map N(z) : S? — G. Since 
Q|N] is an integer, exp(—k I"[M/]) is independent of how the extension into 
the three-space is made if k is an integer. Thus, by using the action 


Sk Swaw (17.45) 


where k is an integer, we can construct a field theory on the two-space M?. 
Since the theory can be defined by using exp(—S) = exp(—kSw zw) to con- 
struct the functional integral, this will be well-defined, not requiring more 
than the specification of field configurations on M? itself. The action (17.45) 
defines the WZW theory; k is referred to as the level number of this theory. 
Even though we presented the arguments for quantization of the coefficient of 
the action for M? = $?, similar arguments and results hold more generally. 

For the simplest case of M? = R? with the appropriate boundary con- 
ditions on the field M(x), we can write the WZW action using complex 
coordinates as 


——_ = I. Tr(0,Md;M~1) + P'[M] (17.46) 


For the first term in this expression, we have used complex coordinates z = 
21 —ix2, J = 21 + ix. This action obeys a very useful identity known as the 
Polyakov- Wiegmann identity. Using (17.43), one can easily verify that 


i = a 
Swaw|M h) = Swaw[M} + Swawlh] — = he Tr(M—!8,M 8,h h-?) 


(17.47) 
Notice the chiral splitting; we have only the antiholomorphic derivative of M 
and the holomorphic derivative of h. This shows that the equations of motion 
are given by 


8,(M-10;M) = M~10;(0.M M~*)M =0 (17.48) 


The WZW action has invariance under infinitesimal left translations of 
M by a holomorphic function, M — (1+ 6(z))M, and right translations of 
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M by an antiholomorphic function, M + M(1+ x(Z)). This is easily checked 
using (17.47). These transformations have the associated currents 


J, = ee 8,M M7} 
Tv 
Jz= * u-1a.M ion (17.49) 


(These currents are for a level k WZW model.) By the equations of motion 
(17.48), these currents obey 


OzJz = 0, a,J¢=0 (17.50) 


The Polyakov-Wiegmann property also gives another result for the WZW 
action which is very useful. Consider a small variation of the field M given 
by M+6M = (1+ 6)M, where 6 = 5M M7? is infinitesimal. Using the 
Polyakov-Wiegmann property, we then get 


6Swzw = -= i Tr (0:(5MM~')0,MM~*) 


= -= [ m(éMM~2,A,) = -= [ memM~D.A) 
1 = 
=-2 / Tr(ASA,) (17.51) 
where A, = —0,MM-—! and D, is the covariant derivative in the adjoint 


representation, D,A = 0,A + [A,, A]. A is defined by 
A =-—0;M M7 (17:52) 
Notice that this obeys the equation 
dzA, —0,A + [A,A,] =0 (17.53) 


17.7 The Dirac determinant in two dimensions 


The functional integrals over the fermion fields in two dimensions lead to the 
determinant of the two-dimensional Dirac operator. For massless fermions, 
this determinant can be exactly evaluated using the WZW action. 

The Dirac matrices relevant for two dimensions can be taken as o;, i = 
1, 2, since they obey the relation o;0;-+0;0; = 26;;. Consider a set of massless 
fermion fields in two dimensions which belong to an irreducible representation 
R of U(N) and which are coupled to a U(N) gauge field. (Other groups can 
be treated similarly.) The Lagrangian for these fermions can be written as 


L= (Di + iD2)p + (D1 — iD2)x (17.54) 
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where w, x may be taken as the two chiral components of a two-spinor. 
D; = 0;+ A; are the covariant derivatives. It is convenient to use the complex 
components D, = 4(D, +%D2) and Dz; = 3(Di —iDz2), since the Lagrangian 
is naturally split in terms of these. 


A parametrization for gauge fields 


The evaluation of the determinants det D, and det Dz can be done most 
efficiently using an elegant general parametrization of the complex compo- 
nents of the gauge field A, and A; given by 


A, — —0,M M-} 
A; = M*-10,mt (17.55) 


where M is a complex matrix; it has unit determinant if the gauge group is 
SU(N). (We shall consider this case first.) The possibility of this parametriza- 
tion can be seen as follows. O, is invertible in two dimensions with suitable 
boundary conditions, since there are no nonsingular antiholomorphic func- 
tions by Liouville’s theorem. The number of zero modes of its covariant ver- 
sion 0, + A, is given by an index theorem, and this is zero for an SU(N) 
gauge field. One may still have paired zero modes for specific A,’s but gener- 
ically these will become nonzero modes under a small change in A. Thus, 
generically we may take 0, + A, to be invertible and then for a given A,, we 
can define a new potential Az by 


OzA,z — 0,Az + [Az, A. | — 0 
Az = (Dz) OzAz (17.56) 


The required matrix M can then be constructed as 
M(za,0,C) = Pexp (- [ A,dz+ Axdz) (17.57) 
oc 


Equation (17.56) shows that M is independent of the path C from the origin 
to x. To see this, consider more generally the parallel transport operator 
along a path C’ from the origin to a point with coordinates z’ on a simply 
connected 2n-dimensional space M given by 


P(a,0,C) = Pexp (- i A.ds') (17.58) 
ONG 


Generally P(zx,0,C) depends on the path of integration C; if C’ is a path 
which differs from C by a small deformation by an area element o” at the 
point y along the path, we have 


ze FO | rr u ; 
6P(z, 0, C) = —Pexp (- Aidz') alee Pexp (- Aidz') 
yC 2 oC 
(17.59) 
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The path-dependence is due to the nonzero field strength F;;. In the case of 
M, we see that the flatness condition (17.56) ensures the path-independence 
of the integral. (The starting point of the integral in (17.57) may be taken as 
any penne not gts the origin.) From the construction of M, we also 
see that A, = —0,MM~— in agreement with (17.55); the second of equations 
(17.55) is obtained by hermitian conjugation. 

If the gauge group is U(1), we can use the elementary fact that A; can 
be written as a gradient plus a curl, A; = 0,6 + €;0;¢. This leads to the 
parametrization (17.55) with M = exp(¢+1 6). Thus the result (17.55) holds 
for U(N) in general. 

If the space is not simply connected, one can have zero modes for 0,; there 
are in general flat potentials a which are not gauge equivalent to zero. (The 
transformation which would transform a to zero will not be singlevalued.) 
Further, there are different types of closed paths, those which are contractible 
and those which are not deformable to a contractible closed path. For the 
latter type of path, the integral P(0,0,C) for the flat potential around a 
closed noncontractible curve C' has a value which is invariant under small 
deformations of the path; it is a topological invariant of the gauge field. (It 
is known as the holonomy of the field around C.) One can then generalize 
(17.55) to include this degree of freedom. As an example, consider the torus 
S! x $1. This can be described by two real coordinates €,, £2, 0 < & <1, 
with the identification of £; = 0 and €; = 1 and likewise for £2. The complex 
coordinate can be taken as z = €, + 7&2, where 7 = Re r+iIm7Tisa 
complex number called the modular parameter of the torus. Noncontractible 
closed curves are generated by two basic cycles called the a and ( cycles. 
Holonomies around these cycles can be generated by a constant gauge field 
a. For the remaining degrees of freedom in A, the previous argument goes 
through and so altogether we can write 


ima 
Im 7 


A,=M | M-!—0,M M7 (17.60) 


There is an ambiguity in the parametrization (17.55). Notice that M 
and MV (z), where V(Z) is purely antiholomorphic will lead to the same po- 
tential A,. On a sphere, or on the complex plane with suitable boundary 
conditions, there are no nonsingular antiholomoprhic functions (Liouville’s 
theorem), and there is no ambiguity. Any gauge-invariant function of M, Mt 
will be an observable. On other spaces where 0, has zero modes, if we use the 
parametrization analogous to (17.55), we have to ensure that only quantities 
which are independent of this ambiguity are considered as observables. Even 
on the sphere, such an ambiguity can exist when there are charges since sin- 
gularities are possible at the locations of charges. Physical quantities defined 
in terms of the potential will be free of such ambiguities. 

One of the advantages of the parametrization (17.55) or (17.60) is with 
regard to gauge transformations. Under a gauge transformation by a U(N) 
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group element g(x), M — M9 = gM. It is easy to see that this reproduces the 
transformation law A > A9 = gAg~!—dg g™}. Since M is complex, a gauge 
transformation cannot be used to set M to the identity matrix. While the 
unitary part of M can be eliminated by a gauge choice, the hermitian part of 
M is gauge-invariant. Alternatively, we may use M'M as the gauge-invariant 
quantity for two-dimensional gauge fields. 


Evaluation of the determinant 


The evaluation of the Dirac determinant will require the regularization 
of the Green’s functions for D, and Dz. The inverse of 0, is the Euclidean 
chiral Dirac propagator and is given by 


jl 
ee 
Ci mE FF) (17.61) 
For D, = 0, + A, = 0, —-0,MM~—, we then get 
=t74 
D;* (a, 2") = pa ee) (17.62) 


n(Z — Z') 


We define a regularized version by a simple procedure of separating points, 
or point-splitting, and write 


Dz" (2, 2") Reg = G(z,2') = [eu a o(a',y;€) 
ers ||5 
o(z’,y;€) = = exp (-==") (17.63) 


Here ¢ is the regularization parameter. Notice that as « — 0, a(2’, y;€) tends 
to a two-dimensional 6-function and we recover Dz1(z, x’). Thus, G(z, 2’) is 
indeed a regulated form of D;'(zx, x’) with better short-distance properties. 

Now let Ser = logdet D, = Trlog D, where the second expression in- 
volves the functional trace and the matrix trace. Taking the variation of this 
quantity, we find 6S.¢, = Tr(D716D_). Since 6D, = 6A:z, 


sae) = Tr [Dz (2, 2')(—#t*)] 


= Tr (G (2, 2)(—#")]. 49 (17.64) 
where the trace is now just over the matrices. In the second line, we have used 


the regularized version of the variation by using G for D7*. When 2’ = z, 
since o(z', y) = a(x, y) is sharply peaked at x ~ y, we can expand G(z, z) as 


Ce.2) = fev ou) ls — Md,M~‘(a) (=*) — Md;M—' 


Bs 7.05) 
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The angular integration gives zero for the first two terms. From the third 
term we get a finite contribution; higher terms vanish as € — 0. Thus we get 


iSerp = f @xTr G(x, 2)(—it)]. 9 SAZ(@) 


- * i d’x Tr [8;:MM~'6A,| 


- -+ / dx Tr(A5Az) (17.66) 


A, and A = A; = —0;MM~—' are matrices in the Lie algebra in the represen- 
tation R. For the trace, we can thus use the formula Tr(t*t?)z = ArTr(t*t*) r, 
where F denotes the fundamental representation and Ap is an integer known 
as the index of the representation R. Equation(17.66) can then be written as 


= Ar dSwzw(M) : (17.67) 


where we have used the property (17.51) for the WZW action. This shows 
that, up to a constant, Sez is given by the WZW action. When A = 0, 
det D, = det 0,, which identifies the constant, giving the final result for the 
chiral Dirac determinant as 


det D, = det(0.) exp(Ar Swzw(M)) (17.68) 


Notice that this part is not gauge-invariant. In fact, under an infinitesimal 
gauge transformation, we find 


1 
bSe¢p = —— / d’x Tr(0zAz 5g g~*) (17.69) 


There is anomaly for the gauge transformation. Since M belongs to the com- 

plexification of the gauge group, we may regard our derivation of the deter- 

minant as the integration of the anomaly 0;A/a over complex parameters. 
A similar result is obtained for the determinant of Dz, 


det Dz = det(0z) exp(Ar Sw zw (M*)) (17.70) 


For the full Dirac determinant det(D,Dz), we can take the product of the 
expressions for det D, and det Dz. This would lead to 


det(D,Dz) = det(0,0z) exp [Ar (Swzw (M) + Swzw(M"))] (Liars) 


This expression is still not gauge-invariant. A gauge-invariant expression is 
given by 
det(D,Dz) = det(0,0z) exp [ArSwzw(M'M)] 
= det (0,0z) exp [ArSwzw (H)| (17.72) 
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H = M'M is gauge-invariant. Expanding out Sw zw (#) using the Polyakov- 
Wiegmann identity, we find 


Swzw(H) = Swzw(M) + Swzw(M") 
-of Tr(M'~'0,M! 0,M M~") 


1 
= Swzw(M) + Swzw(M") + = pe Tr(AzA,) 
(17.73) 


Thus the formula (17.72) can be understood as arising from the formula 
(17.71) by the addition of a local counterterm (1/7) f Tr(AzA,). As we dis- 
cussed in Chapter 13 and earlier in this chapter, one has the freedom of adding 
local counterterms; this is equivalent to choosing a different regularization. 
Effectively, in obtaining the result (17.72) we are using a gauge-invariant 
regularization. 

The Abelian version of (17.72), including the counterterm for gauge in- 
variance, was first obtained by Schwinger in 1961. For the Abelian case, 
equation(17.72) simplifies as 


det(D,Dz) = det(0,0z) exp -z i, Fy, (x)G(a — y) Fav (y) 
ry 


G(e-y) = [ a = ewlip-(e~ 9) (17.74) 


This is the fermion determinant for two-dimensional electrodynamics, which 
is also known as the Schwinger model; we see that the effect of fermions is to 
generate a gauge-invariant nonlocal mass term for the gauge field. 
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18 Finite temperature and density 


18.1 Density matrix and ensemble averages 


In the previous chapters, we have focused mostly on processes with a small 
number of real particles in a background which is the vacuum state. How- 
ever, there are many physical situations where one encounters processes in 
a medium such as, for example, the propagation of particles and scattering 
processes in a gas at finite temperature and density. If the system is in a pure 
state, we can still write the transition amplitude for a scattering process as 


A(i + f) = (F181) (18.1) 


where § is the scattering operator. The evaluation of the matrix element may 
be difficult calculationally, but this does indeed give the answer. However, if 
the initial state is only statistically specified, which is the case for systems 
at finite temperature and density, we are interested in averaging over initial 
states with appropriate probabilities. (This is the probability of the choice of 
a particular state in a statistical ensemble and is not the quantum probability 
of the collapse of the wave function onto a specific eigenstate of the observable 
being measured.) The choice of states in quantum mechanics is given by the 
density matrix p. Consider the averaged expectation value of an observable 
A, given by 


(A) = >= ca (a|Ala) (18.2) 


where |a)’s form a complete set of states, and c, is the probability of finding 
the state |) in the statistical ensemble. Because cg is a probability, we need 
>. Ca = 1. Defining 


p= >a |e) (a (18.3) 
we can write the above formula as 
(A) = Tr (pA) (18.4) 


The normalization of the c,’s translates to Tr p = 1. These two equations 
may be taken as the definition of the density matrix p; in other words, p, 
with Tr p = 1, is an operator which specifies the choice of states by giving 
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the average as in (18.4). Notice that this gives a basis-independent definition 
of p. The choice of a pure state |) corresponds to c, = 1, cg = 0 for a # 4. 
In this case, p? = p, and we may take this as the definition of a pure state. 
If p? # p, we have a statistical mixture of states. 

The evolution of states via the Schrédinger equation leads to 


i. =H pp a (18.5) 


The density matrix p may be considered as the quantum analogue of the 
phase-space density of classical statistical mechanics; equation (18.5) is the 
quantum version of the Liouville equation. 

The density matrix is an operator corresponding to the choice of states. 
The Hilbert space of a quantum system gives all the possible states of the 
system, but when we are interested in discussing a system which has been 
prepared in a particular state (or a particular mixture of states), we need 
an operator to specify this choice. This is done by the density matrix. It is 
somewhat special in that it evolves with time in the Schrodinger picture but 
is time-independent in the Heisenberg picture. If the system under study is in 
a pure state |a), 4 = |a)(a|. The probability of finding the system in a state 
|b) is then given by |(a|b)|?_ = Tr(papp). If we start with a state described 
by pa(t) at time t, it evolves into pq(t’) by a later time t’ according to (18.5); 
the probability of observing a state given by p» in this evolved state is given 
by 


P(a = b, t,t’) = Tr(pa(t’)pv(t)) 
= Tr(e~*#(—#) 9, et #(t'—4) 9.) (18.6) 


This may be interpreted as the scattering probability for a state |a) at time 
t to evolve into |b) by the time t’. As t — —oo, t’ — +00, this is the abso- 
lute square of the S-matrix element. It is possible to rephrase many of the 
calculations we have done in terms of the density matrix. 

Since the time-evolution of p is given by a unitary transformation, it is 
possible to write an action for the density matrix which leads to the equation 
of motion (18.5). Suppose we start with po at time t = 0. Let U be an 
arbitrary unitary operator on the Hilbert space of the theory. The action is 
then given by 


S= / dt [in (cout 0) - Tr(poU" HU)| (le) 


The entire dynamical information of the system is in U, the density matrix 
at time t is given by p = UpoU'. The variation of the action (18.7) is given 
by 


6S = / dtTr uni + Upol tH — U pout | 
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=| dtr |(-z = Ramee ott) ( (18.8) 


where £ = 6U U! and we have done a partial integration. We see that the 
extremization of the action (18.7) by varying with respect to U leads to (18.5). 
It is possible in some circumstances to use (18.7) as an effective action in a 
quantum theory by writing U in terms of the subset of operators which are 
important in the chosen kinematic regime. 

Consider now a dynamical system in thermodynamic equilibrium. At equi- 
librium, p must be independent of time and so we must have [H, p] = 0. The 
general solution is that p should be a function of operators which commute 
with the Hamiltonian; in other words, it should be a function of conserved 
quantities. If we consider a number of such quantities which are additively 
conserved, it is easy to fix the form of this function as follows. Consider 
two completely independent systems J and IJ. The Hilbert space of the two 
systems together is the tensor product of the corresponding Hilbert spaces, 
H =H; © Hr, and the probability of a particular state will go as the prod- 
uct of individual probabilities; therefore p = p; - pry. If O is an additively 
conserved quantity, we have O = O; + O7;. Thus p must be an exponential 
function of the additively conserved quantities. The general solution is of the 


form 
i i ze (- 2,80 ? (18.9) 


O; are additively conserved operators, such as energy, momentum, angular 
momentum, charge, etc. Z = Trexp(— }7; G;0,) to ensure the correct. nor- 
malization of p: it is the partition function. @; are constants, to be specified 
by the average values for the conserved quantities for the system under con- 
sideration. Since p commutes with H, we can also diagonalize it, thus writing 
equation (18.9) as 

= V paiag V' (18.10) 


where V is a unitary transformation which commutes with H, correspond- 
ing, for example, to rotations, charge rotations, etc., The parameters in the 
transformation V, as well as the parameters in Pgiag, will serve to define the 
@; in (18.9) and hence the average values of the conserved quantities. For a 
nonrelativistic theory, particle number can be a conserved quantity, leading 
to the familiar chemical potential term 3yN in the exponent of (18.9). In the 
relativistic case, particle creation and annihilation are possible and one can 
only require conservation of appropriate charges. 

If we consider a fluid with local equilibrium, one can take the form (18.10) 
to be valid for small fluid elements. The parameters of the unitary transfor- 
mation are then approximately constant for each fluid element but can vary 
over larger regions. An approximate description of the dynamics is obtained 
by restricting the action (18.7) to just the modes corresponding to these 
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local parameters; this could provide a way of obtaining the hydrodynamic 
description of fluids. 


18.2 Scalar field theory 


For an application of these results, consider a reah sealar field theory. In 
this case, the only conserved quantities are energy, momentum, and angular 
momentum. The equilibrium p is thus a function of these quantities. We can 
bring the system to rest in a particular Lorentz frame and then rotate it such 
that the angular momentum is-in a fixed direction, say, along the third axis. 
Going back to the general situation is accomplished by a general Lorentz 
transformation (including spatial rotations). The form of p is then 


p = V exp(—(§oH —13Mj2) V' ~ (18.11) 


where Mj. = Jz is the third component of angular momentum. V represents 
the unitary transformation corresponding to a general Lorentz transforma- 
tion. Go and lg are real numbers. From the transformation properties of the 
operators, we can write 


p = exp(—(oL#P, —Ig3L4L¥My,v) (18.12) 


where P,, = (H, P;) are the total energy and momentum operators and M,, 
are the generators of Lorentz transformations, including rotations. L4 is the 
Lorentz transformation matrix for a vector; the parameters in L‘ arise from 
parameters in the unitary transformation V. Lg can be identified as the four- 
velocity of the moving medium, $13(L{L¥ — LYL4) is related to the angular 
momentum of the medium, with the parameters of the Lorentz boost trans- 
formations in L4 determining the velocity and the rotation parameters in L# 
determining the angular momentum. By evaluating the average expectation 
value of energy, one can identify Go as the inverse temperature in the rest 
frame of the medium, Jo = Ty” ae 

In the rest frame, and in the case of zero angular momentum, the partition 
function is given by Z = Trexp(—{)H) = Trexp(—GH). We will drop the 
subscript on § from now on for simplicity of notation; all quantities below 
are in the rest frame of the medium. For a free field the partition function is 
easily evaluated as follows: 


Z = Trexp(—( Sy wpal ag) 
k 
= iT Try exp(—Gwxalax) 
k 


= JE (1 +e Pee + e~28% 4...) 
k 


1 
= Il [arene (18.13) 
k 
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Here Tre involves the sum over states at fixed value of k. Since i commutes 
with alar, it is clear that Tr alalp = Ty axapp = 0; further Tr al app = 0 for 
k # p. The average occupation number n, is obtained as 


er = (al ax) = a5 log Z 
= ee (18.14) 
The average value of the Hamiltonian is given as 
= 1¥ ( Sone) 
k 
a) oe (18.15) 


(Q7r)3 ehwre — J 


We have taken the large volume limit in the last step. This is the standard 
Bose-Einstein formula, as expected, since the scalar field describes bosons. 
The propagator for the free scalar field can be evaluated, using (al.a1) = 


nr Ort, (al.at) = (azar) aallmas 
G(x, y) = Tr[p T ¢(z) om 
= = O(n Les (273 x -|a a mye tee) 8 npei*e— 1 


—ik(2— ” etk(a— y) 
a em Qw,V =e 2W,V 


+ a a ie ote) 


= i =. ane + np 2r 5(k2 —m | e~t(z—¥) (18.16) 


)4 | k? —m2+ 


In addition to the standard zero-temperature contribution, we have a term 
proportional to the occupation number. The propagator gives the amplitude 
for detecting a particle at the spacetime point x if a particle is introduced 
into the system at y. The first term describes the probability for the particle 
to propagate to y from xz; however, there is also the possibility that the 
detected particle is from the background distribution, this is accounted for 
by the second term. 

The temperature-dependent propagator can be used for calculations in- 
volving scattering, corrections to parameters, etc. For example, we have seen 
that the mass correction for a \¢*-theory can be given as 12AG(a, x). Using 
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the propagator in (18.16), the temperature-dependent part can be evaluated 
as 


dk 2, 
PH —<—<——— 
ém* = 12a | (Qn)Quy, oP 1 
= AT? T(a) - (18.17) 


where a = m/T and 


(a) = 3 [a= art. 
On fu? Fa? eve? — 1 


As the particle propagates in the medium, it can interact with the background 
distribution of particles. One consequence is that the effective mass of the 
particle is increased. At high temperatures, Z(a) ~ Z(0) = 1 and 6m? = AT?. 


i (18.18) 


18.3 Fermions at finite temperature and density 


The creation and annihilation operators for fermions obey anticommutation 
rules. We shall consider the case of relativistic fermions, with creation and 
annihilation operators ai, az,r for the particles and bes by, for the an- 
tiparticles. Generally, we also have the conservation of fermion number which 
would be proportional to the electric charge for electrically charged particles. 
For free fermions, we can write H = De eelpaies ar Ed}, ,dk,r) and the 
conserved fermion number is N = Be (Gh ie - bjmbie;). The partition 
function is 


Z = Trexp[-A(H — wN)| 


= Trexp |—£8 S "(Ex = Hah, .Ak,r = So (Ex ar 1)dj, .dk,r 


kyr kr 
= ||P emexp |-A(E - a}. .a%..| ] | Tp.s exp [-B(Ep + 4) b, .bp, | 
kyr p,r 
= I] (1 a age Il ¢ ae ese) ™ (18.19) 
kr P,s 


Trx,r denotes the trace over occupation numbers at the fixed value of mo- 


mentum, k, and for fixed spin r. The average occupation numbers are given 
by 


i 
% @B(Ex—v) 41 


= _ at he it 
Nk Ors = (bj, pbk, s) = Or,s eAEntH) +1 


Nk Or,s =a (al. .ax,s) = br 


(18.20) 
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As expected for fermions, we have the Fermi-Dirac distribution. For positive 
Ht we have a net excess of particles over antiparticles. The average value of 
the Hamiltonian is 


(H) = Tr(o H) 
3 
=>> Vo Ex (ng + fix) (18.21) 


The propagator for a Dirac particle can be evaluated as 


S(x,y) = Tr [p T(x) h(y)] 


= [6(2° — y°)S, — A(y° — 2°) S<] (18.22) 
where 
8, = f FPF [la —npdey-pt mye + fgly-p— mee 
(27)3 2E, i 
Se = / eo n (y rs pt m)e—*P(=—9) + (1 —7y Vy -p— m)e'P(2—¥) 
(Qin)? Qiy|. ” ss 


(18.23) 
These expressions may also be combined and written as 
dp yoptm , 
ar eae) 
eal dla (Qm)4 p? — m? + ie 
ap 1 
(2%)2@E, 


[—rp(y -pt+mye~P?@—Y) + Ain(y-p— m)er(e—v)] 
(18.24) 


We see that the propagator again splits into the standard zero-temperature 
part and a part which depends on the distribution functions. 


18.4 A condition on thermal averages 


Since the Hamiltonian is the operator for translations in time, e~°” can 
be interpreted as shifting the time-arguments of operators by an imaginary 
amount —i3. This can be very useful in certain calculations as well as for a 
functional integral version of the partition function. For an explicit formula, 
consider the average of the product of two operators A(x) and B(y) given by 
Tr(e-8 A(x) B(y)). By the cyclicity of the trace, we may bring B(y) to the 
left end of the expression inside the trace and write 


(A(x) B(y)) = Tr(e~** A(x) B(y)) 
Tr(B(y)e~8" A(z)) (18.25) 
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From the time-translation property of H, B(y)e~°% = e~®7 B(y® — iG, y). 


Alternatively, we can Fourier analyze B(y) in time and write 


Bly) = / dw B(w,y) ev" (18.26) 
The Heisenberg equation of motion then shows that , 
[BWw,y), H] =w Bw, y) (18.27) 


Using this result, we can write 
B(y)eP# = fw e FH [eH Bw, ye P| tw? 
= fw e FF Bw, y) e~ Bw ety 


= Cpse fw B(w,y) e (v9) 
= e FH B(y° — iB, y) (18.28) 
Finally, using this condition in (18.25), we get 
(A(z) B(y)) = (B(y° — iB, y) A(z)) (18.29) 


This statement on thermal averages is known as the Kubo-Martin-Schwinger 
(KMS) condition. 


18.5 Radiation from a heated source 


As another application of the concepts we have introduced, we consider radi- 
ation from a heated object. The source is kept at a constant temperature by 
electric currents or some other means of supplying energy; it could also be 
producing energy by itself like a star. The object is in vacuum and radiates. 
The radiation escapes and the source is not in thermal equilibrium with the 
surroundings. We want to calculate the spectrum of the emitted radiation. 
(There are well-known arguments which lead to a Planck-type spectrum if 
one can assume that the source is in equilibrium with the radiation. Such a 
situation is clearly not obtained in most cases, for example, for a star which 
radiates in empty space. Our idea is to treat this problem without further 
assumptions. ) 
The general form of the interaction part of the action is 


Sint = i d‘z eA, J (18.30) 
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Consider a transition from a definite initial quantum state |7) to a final state 
|f). The transition amplitude for photon emission of four-momentum k and 
polarization aN ) ig given by 


(f|J# (x) e***|i) (18.31) 


= —1e 


a 


We take the square of this and sum over final states; the final states corre- 
spond to the states of the photon and the states of the source. We write the 
summation over the photon polarizations as 


Se ea = Pe (18.32) 
Xr 


We have 6) = 0, so that Poo = Po; = 0 and 


kik; 
k-k 


Pj = by - (18.33) 


For the transition probability from the initial state |i) we thus get 


TP =e ae | atetty MOR, GMM) (1839 
uf 


We have used the completeness relation for the final states of the source. 

Since the choice of the initial state is specified only statistically, we must 
average the above result over initial states with p = e~°” corresponding to 
the fact that the source is kept at a temperature G~!. This gives 


(DAP) = 4 [ctedty SR A eH MP, (IMI (W)) 18.35) 
Introducing p*”(w, x, y) by 
(Tea) I"(y)) = f dn et) p(w, 2,9) (18.36) 
we get 
(J*(y)J(a)) = f da 0") p(w, yy 2) 
me / dus (@"-¥) p(w, y, 2) (18.37) 


Equation (18.29) applied to this average gives 
p’#(—w, y, x2) =e” pM” (w, x,y) (18.38) 
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Defining 
AY’ (w, 2, y) = p’4(—w, y, x2) — p’’(w,2,y) (18.39) 
we can solve (18.38) in terms of AM” to get 
Ab” (w, @; Y) 
pe’ (w, @, y) = ro (18.40) 


—— 
The rate of radiation for photons of momenta in a small range of values 
around k can be obtained by substituting this in (18.35) and dividing by the 
total time f dx°. The result is 


x 


(lA? ) 
aD = 


2 A 
ia = / Prdey (=v) P,, AM (wy, 2, y) INpianck 
Wk 


> dk 


dN Ptanck = 
dNptanck is the Planck distribution function. AY” (w,,x, y) depends on the 
nature of the heated source; it is evaluated at the photon energy w, in the 
above formula. If the source has a very large number of energy levels, with 
every possible value of w being equally likely, we get a pure Planck distribu- 
tion for the emitted photons. Otherwise, the radiation rate will have maxima 
characteristic of the source. 


One can also express A¥” in terms of the commutator of currents. Com- 
bining (18.36) and (18.37), we get 


(["(a), Poy] )=— fidw ee) AM (w,zy) (18.42) 
The retarded two-point function for the currents is defined as 


IIR’ (x,y) = —10(2° — y°)( [J*(z), J”(y)] ) (18.43) 


Using the Fourier transformation in time, we can write this as 


Hy (ey) = f del ee) Tw, 2,9) 
dw AM (w, 2, y) 


Meliss 3). 2x wl —w—ie 


(18.44) 


where we have used the result 
iw’ (x°—y°) 
- iw(2°—y°) 6(9 — ee du’ el” 
ie (x° — y°) Se (18.45) 


Equation (18.44) leads to the result 
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! ee) Pp. IT (w', ayy) = fie i ew Py AM (w, 2,4) 
diy 2m [Jay 
ea 
w! —w — ie 


(18.46) 


Comparison of this with (18.41) shows that the quantity in square brackets 
is real, since dr’ and dNpijanck are real. Taking the imaginary part of this 
equation, we then have 


tk-(a2— Vv 1 ike. — 
Im | i ena Pag lie (2.0) =5 / ele (e-v) PAM (w!, @,y) 
zy cy 


(18.47) 

where we used the standard formula 
pees aay d + ind (w’ 18.48 
P20 ae at) Cae 


Using (18.47), we can finally write the expression for the radiation rate as 
Te - 
dr = _ ImIT, (wr, k, —k) Puy dNptanck (18.49) 
where 
TAG k, =k) = if BPadry (ev) ITH’ (w, x, y) (18.50) 


This shows that Im/I}" (wx, k,—k) captures all the material properties of 
the source which can be seen in the radiation. In systems with a large num- 
ber of closed spaced energy levels with roughly equal radiation intensities, 
(1/w,,)Im/T>” (w, k, —k) is approximately constant and this becomes the usual 
blackbody radiation formula. 


18.6 Screening of gauge fields: Abelian case 


In a plasma of positive and negative charges, electrostatic potentials can be 
screened due to the accumulation of oppositely charged particles near any 
test charge. This is the Debye screening effect. A similar effect can occur at 
finite densities of particles. This screening effect can be described as being 
due to an effective mass for the gauge fields, in this case, the electromagnetic 
field. The usual argument for the absence of mass for a gauge field requires 
gauge invariance and Lorentz invariance. In a system at finite temperature 
and density, there is a preferred frame, namely, the frame in which the tem- 
perature is T = G~'. This is the rest frame of the plasma. In this case, we 


410 18 Finite temperature and density 


cannot demand Lorentz covariance of the polarization tensor, we can only use 
gauge invariance. The property of gauge invariance allows for certain (nonlo- 
cal) mass terms. Such terms are generated by the interaction of the photon 
field with the charged particles of the medium. This effect can be obtained 
by calculating the polarization diagram for the photon, not in vacuum, but 
in the background of the medium, using the propagators appropriate to the 
medium. (One still has, as one must have, Lorentz invariance when the overall 
motion of the plasma is taken into account, as indicated after (18.12). ) 

The screening effect can be demonstrated by the simple case of zero- 
mass fermions coupled to the photon. The induced mass term is bilinear in 
the photon field and so the relevant quantity is the two-photon term in the 
effective action I’; it is given by 


’ 


r@) a5 f ate dy Tey A@)S(eu)y-AW)S(y,2)] (08.51) 


(We do not show the coupling constant here, it can be restored by the scaling 
A, — eA, at the end.) Since we do not have manifest Lorentz invariance, 
there is no particular advantage to simplifying this in a Lorentz-invariant 
way. We shall carry out the time-integrations first. Using equation (18.22) 
and carrying out the time-integrations, we get 


3 
re) == f dulk) dete! 


2 (27)° 2Ep 2Eq 

To) gee BE Beet 
T(p, oa erases i are Pe 
fs) a ee eae 


T i 1 Gas - BrQq 
(p',4 memos are Et ce ye a) | (1852) 


where a, = 1— 7p, Bp = Np and 


= (a\t5(4) (he 4 fy OE 
du(B) = On) (k +) a 
T(p,q) = Tr[y- A(k) y-p y- A(k’) ¥-q] (18.53) 


and p = q +k in equation(18.52). Further, p’ = (p°, —p), q’ = (q°,—q) and 
Po = Ep = |p|, go = Eq= lal; Np is given by 
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1 i! 


Np = rs are ra = ePVT +] 1 (18.54) 


As an example of the kind of simplification we did, consider the term in 
(18.51) with 2° > y°, so that we get Sy (a,y) and Sc(y, x). In using (18.22), 
we get, among others, a term like 


i = * 
= |, / aa? [ dy? Tr[y- A(k) y-py- A(k’) y-4] 
x,y 4 —oo —oo 
X Apap el(k—P- at ilk’ +p+a)y (18.55) 


We use a convergence factor e®” for the y°-integration which leads to a 
factor —i(k’? + p°® + q° — ie)~!. Further integrations give energy-momentum 
6-functions; one of them implies k’? = —k°. This leads to the denominator of 
the apQ@p-term in (18.52). One of the other 6-functions implies gq = —p+ k. 
We then make a change gq — —q so that we may write p = q+k. This gives 
the coefficient T(p,q’), where g’ = (q°,-q) = (E,,—q). The other terms 
simplify in a similar way; we get q’ and p’ in some terms when we bring them 
to a form where we have p = q+ k for all terms. 

The convergence factors et(2°—-v") were put in appropriately to do the 
time-integrations; these appear as the ie-factors in the denominators. The ie’s 
can be taken to go to zero at this stage. They can contribute to the imaginary 
part. Here we are interested in screening effects which are described by the 
real part of the two-point function, so we shall not need them. Also, the 
relevant imaginary part, for most physical situations, is that of the retarded 
function which is not directly given by the above time-ordered functions in 
(18.52). The correct imaginary part can be obtained from the expression 
for the real part by the prescription kj — ko + ie for the external photon 
momentum k. 

The real part of the medium-dependent part of I’) is given by 


_\ (p,q) 2 T(p', 4) 
(Mp + Ta) gO Me + Mo) 
ny) 2w9) 
+s tn) 5 —E, +k (18.56) 


The screening effects pertain to the long distance behavior of the fields; 
equivalently, a mass term is important at low values of k. For the nonvacuum 
contribution, the average value of the loop momentum q in (18.52) is given 
by the temperature T or the chemical potential 4, depending on which is 
larger. In extracting the screening mass term, we may therefore simplify the 
expressions by taking |p|, |q| >> |k|, so that E, — Ey —k® ~ —k-Q, Ey— Eq+ 


A412 18 Finite temperature and density 


k°o = k-Q!, Ey + Ey +k° = 2E,, where Q = (1,q/Eq), Q’ = (1,—4@/E)- 
(Notice that Q and Q! are lightlike vectors, Q? = Q’? = 0.) Further, 
T(p,q) © 8EZA1-Q A2-Q 
T(p',q') ~ 8E7 Ay -Q! A2-Q’ 
T(p',q) ~ T(p,q') = 42 (Ar - Q’A2 Q + Ai .QA2: Q! — 2A; - Aa) 
(18.57) 
where A; = A(k), Az = A(k’). 


The difference of the distributions can also be approximated as np —ng ~ 
(dn/dq°)Q -k and the — result can be used. 


A iq sp f() = os aS(Q) (18.58) 


for any function f of Q or Q’. (For us f will be terms from T(p,q) and 
T(p’,q').) We can further use 2Q-k = k-Q’ —k-Q. Expression (18.56) can 
be simplified using (18.57) and (18.58) as 


r@ =i 5 [awe [ ene [EB(oa- QA, C= aee sao 


k-Q! 
k-Q 


n / , k- 
— 7 2A -Q’A2-Q re a (18.59) 


ARQ AOE A: | — 2a OARO 
E, 


The angular integration in (18.59) over the directions of g (or Q) help simplify 
it further by virtue of 


/ dSU2A4-Q Ao-Q—-At-OAs-O'=Ay-O Ape a / d2(2A1-QA2-Q’) 


(18.60) 

Defining 

A- -Q! 
we can write (18.59) as 
roo} foun [B 
= k- k- 
f(A, TY, ni) —r6 Ay4 A2_ (n + n) —n a —_— nA Aa. | 

(18.62) 


At high temperatures, i.e., T >> ps, we also have Np © Ny. The integral 
can be evaluated in this case to yield 
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1 T2 k-Q’ k- 
Po = / dus(k) (=) iN dQ [2A An — FS Ay Ang 2 Ay Ao] 


(18.63) 

The other limit of interest is at high densities corresponding to a degener- 

ate gas of charged fermions; in this case, T < yp. As T/p — 0, the antiparticle 

occupation numbers fi, — 0 (for positive yz). In this case, (18.62) simplifies 
to 


il 2 k- OQ’ ; 
pli 5 [09 (4) [eo [2A An — FS Ans Any pe AL Aa | 
(18.64) 
and we have taken the limit of T — 0 (small compared to pz). Notice that 
screening at finite density occurs even with one type of charge, because charge 
fluctuations above the common background charge are what are relevant. 
We can discuss plasma oscillations using this screening term. By adding 
I’) to the Maxwell action, we get an effective action which will incorporate 
some of the effects of the matter distribution at finite temperature, 


Ses =-73 i Be) (18.65) 


This is a low energy effective action since we have made a long wavelength 
approximation in simplifying it. Long-wavelength and low-frequency plasma 
waves are the classical solutions of the effective theory (18.65). It also in- 
cludes effects such as the screening of Coulomb fields. In terms of the Fourier 
components of A,, we can write 


Sup = f Ay(-KM™ (ALB) a (18.66) 


where 


MY = (- _ kent” + eke) + 2 larntn v0 - fan; dQ 


oo 0"0" (18.67) 


k-Q 


k# My, = 0 in accordance with the requirement of gauge invariance. We have 
restored the coupling constant e by the scaling A, — eA,. We now split A, 
into a gauge-dependent part and gauge-invariant components as 


Ay = kypA(k) + Gu + &u (18.68) 


where A shifts under gauge transformations and ¢,,, , are gauge-invariant. 
¢, € correspond to the three directions orthogonal to k,, and they can be 


parametrized as 
ae 
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&=eMa,, A=1,2 (18.69) 


The e;’s form a triad of spatial unit vectors. If needed, a specific choice, as 
indicated in Chapter 6, would be 


(3) kj 
e°) Seo =o 
Vk? 
ie 
ef) = €ij = ,0 
V kp 
| 2 
Be = k3k; a xa , += 1,2; (18.70) 
kee? VR 


where k2, = k? + k3. Notice that ke) = 0, \ = 1,2. ¢ and ay are the 
gauge-invariant degrees of freedom in (18.68). When the mode decomposition 
(18.68) is used in (18.66), we get 


k? 
cr i, OP + EF (18.71) 


Sy = 4 f — é,(—k) (Fost) M7 (k)E;(K) + 6(-K)M*(k) 65) 


where 


272 k2 k2 k 
MT (k) = kj — k? ~ — | a+(-3) 1 | 
2772 k 
Mo) = 24 ee 
(k) + = c IKE 
ko +k 
ko —k 


L=log ( (18.72) 
We have also included an interaction term with a conserved source J, in 
(18.71); i.e., we include f A,,J“ and simplify it using (18.69). From (18.71) we 
see that the interaction between charges in the plasma is governed by (M¥)~, 
which shows the Debye screening with a Debye mass mp = 1/e?T?/3. The 
action (18.71) can also give free wavelike solutions. The dispersion rules 
for these plasma waves would be M? = 0 for the transverse waves and 
(k? — k2/k?)M¥” =0 for the longitudinal waves. (The extra factor multiply- 
ing M_, is from rewriting ¢(k) in terms of the potential.) 
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18.7 Screening of gauge fields: Nonabelian case 


We have considered the screening of Abelian gauge fields in the last section. 
Here we will extend those considerations to the nonabelian theory; this is 
relevant for the quark-gluon plasma, for instance. Consider the term in the 
effective action I’[A] which represents the screening effect. The term quadratic 
in A’s is what we have calculated. In the nonabelian case, the structure of this 
term remains the same, except for the trace over the Lie algebra matrices. 
The prefactor also changes since the gauge bosons (or gluons) themselves can 
propagate in the loop and contribute to the two-point function. But there are 
also many higher point functions which are nonzero and which are needed 
for reasons of gauge invariance. Screening is an infrared or low-momentum 
effect and so we are interested in the simplification of loop diagrams when the 
external momenta are small compared to the loop momentum. The average 
loop momentum is of the order of the temperature T’, since it is controlled by 
the statistical distribution function. One can then establish a set of power- 
counting rules, applicable when the external momenta are small (of the order 
of eT), and isolate the relevant diagrams. They will be one-loop diagrams 
with the loop momentum of the order of J’ and external momenta of order 
eT; they are called hard thermal loops. Rather than calculating such diagrams 
directly, we will use an argument based on gauge invariance to get the result. 
There are two key properties which are important to our analysis. 


1. I'[A] is gauge-invariant with respect to gauge transformations of the 

gauge potential A, and is independent of the gauge-fixing used to de- 
fine the gluon propagators. 
We can understand how the kinematics of hard thermal loops can lead 
to gauge invariance. The thermal propagators satisfy the same differen- 
tial equations as the zero temperature propagators. This is evident from 
(18.16) and (18.23). The extra term involving the distribution functions 
obeys the homogeneous free particle equation of motion, so the propa- 
gators differ from the zero temperature ones in the choice of boundary 
conditions only. As a result, the generating functional of one-particle 
irreducible vertices, viz., I’[A,c,é,...] obeys the standard BRST Ward- 
Takahashi identities. Keeping only the part of I relevant for external 
gauge fields, we thus have the identity (10.123) 


ee ee 18.73 
5Ag 6K 5L85ce “64 5c Se 


fe E or oral ~.. olor 
In the hard thermal loop approximation, terms involving the ghosts are 
subdominant. Recall that, with a gauge-fixing term ~ (0- A)’, the ghost- 
gluon coupling involves f7°°A*“ (0,,@°)c°; the derivative is on the antighost 
field. In a diagram with external ghosts, this is a power of external mo- 
mentum and therefore such a diagram is smaller compared to a similar 
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diagram with the ghost lines replaced by gluons where there are contri- 
butions with all derivatives on the internal lines which give powers of T. 
The terms or , oF i in (10.123) are thus negligible in the hard thermal loop 
approximation. Further, we are interested only in one-loop terms. The 
identity (10.123) then gives the gauge-invariance of I’. Thus, effectively, 
in a high-T-expansion, the leading term I[A], which is proportional to 
T?, is gauge-invariant. Since the thermal contribution to the propagator 
is on-shell, the T-dependent part of a one-loop diagram is classical and 
so it is not surprising that the BRST Ward identities reduce to the state- 
ment of gauge-invariance. The fact that I’ does not depend on the gauge 
choice for the gluon propagators can be seen by similar arguments; iden- 
tities for the variation of [ under changes in the gauge-fixing function 
can be written down and simplified as in the case of (10.123). 
2. I'[A] has the form 


= (N+4Ny,) a [fas anAgag+ [ a0 w(A-Q)| . (18.74) 


This is a restatement of the result (18.63) in the form used in (18.67). The 
angular integration is over the directions of Q. Here we are considering an 
SU(N) gauge theory and N; is the number of fermion flavours. The key 
point about this equation is that, for each Q, I"[A] involves essentially 
only two components of the gauge potential, Ag and A-Q. The df2- 
integration will bring in all components of the potential, but for each 
Q only two components are needed. In the explicit calculation of the 
two-point function, Q was the angular part of the loop-momentum gq. 
The d{2-integration in (18.74) is over the orientations of Q and is the 
unfinished part of the loop integration, after the integration over the 
modulus of the loop momentum has been done. The structure of (18.74) 
can be seen to be general by analysis of diagrams again. For example, for 
diagrams with the derivative gluon coupling due to the interaction term 
Hf ae Aba AS, since p- A = q- A, the possible tensor structures are q? A? 
and Ge. A)?. The former is zero since the thermal part of the propagator 
involves the 6-function 5(q?). Writing q = |q|(1,Q) and carrying out the 
|q|-integration, we are left with a structure like (18.74). This argument 
generalizes to diagrams with an arbitrary number of external gluons. 


Given these two properties of I one can determine W and hence I" simply 
by the requirement of gauge-invariance. First of all, define two derivatives 
Oa 3Q -O and 0_ = $Q’ - 0 in a way analogous to our definition of A+ in 
(18.61). The condition for gauge-invariance of I" is then 


/ d0 (|p. (GE)) * 4d 45 = (18.75) 


Using the identity 
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Pee / dQ 8o(Q- A*) =2 (0, + 0-)A% (18.76) 
we see that the condition of gauge-invariance (18.75) is satisfied if 
ow 
oi || — ee eee ae (18.77) 
SA, 
Further, if we define A_ by 
OW _ . 
saa = —2 A® 2 At (18.78) 


equation (18.77) becomes 
D:A_ — 0A, =0 (18.79) 
This is similar in form to equation (17.53) of the WZW theory, 
0;A,-—0,A + [A, Az] =0 (18.80) 


Equation (18.80) was obtained for the WZW action in Euclidean space. In the 
present case, it is easier to define a WZW action with Minkowski signature. 
If we are in two dimensions, with 04 = $(do + 01), 


Swaw(U) = == / dx Tr (0,U0_U-) 
1 


+ dime” "Test Uso So.) 
127 M3 


(18.81) 
The Polyakov-Wiegman identity for this WZW action is 
Hi 
Swzw(hU) = Swzw(h) + Swzw(U) — r= je Tr@'e_h 0,UU—*) 
(18.82) 
Further, if we define A, = —0,UU—, we find 
1 
6Swaw(U) =—= | dx Tr(A_6A4) (18.83) 


This is the analog of (17.51). 
In our four-dimensional case, we note that 0, = $Q - 0 is invertible with 
8G(z,y) = 6 (x —y) 
d*p eiP(e—y) 2 
(27)4 ip-Q 


Thus it is indeed possible to define a unitary matrix U such that 


G(z,y) = (18.84) 
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(Notice that one can, at least as a series in A, solve this equation for U.) The 
equations of interest to us have no derivatives with respect to the transverse 
coordinates x7, Q-a7 = Q’-a7 = 0. They simply act as some parameters on 
which the fields can depend. We may then use the identity (18.83) to write 
the solution for W as Se 


ie = 2 f de Tr(A,A,) — an [ Ps" Swaw(U) (18.86) 


The WZW action must be integrated over the transverse coordinates, since 
the original WZW action was defined in two dimensions and hence carry 
integration only over the z* coordinates. Equation (18.86) leads to 


= (N+ 4Ns) = | d' 2nA% Ae 


“ i ao{2 / d*z Tr(A,A,) — 4a ii a2" Swaw(U)} 


We can rewrite this in a more transparent way by introducing another unitary 
matrix V such that 


(18.87) 


A_ =-0_V'V=V'd_V (18.88) 


so that U — Vt under Q © Q’. Since we are integrating over all directions 
of Q, f d2 Swzw(U) = f dQ Swzw(V'). Further we have the identity 


In ASAS +2 / dQ Tr(AyA,) = -2 \ dQ Tr(AyA_) (18.89) 


which follows by straightforward d{2-integration. Using these two results, 
we can finally write the effective action which describes screening for the 
nonabelian theory as 


PUA) = (N+ 3Nj) 5—(-2n) if [F(A A) + Sw2w(0) + Swaw V9) 


=—-(N+ LN) - f 27 dQ Swzw(V'U) (18.90) 


where we have used the Polyakov-Wiegmann identity to combine terms. 

The final result for the screening term in the effective action is thus very 
simple and transparent. It is given by the gauge-invariant WZW action for 
fields on the lightcone defined by x° + Q-, the unit vector Q specifying the 
orientation of the lightcone. In general the fields depend on the transverse 
coordinates as well. The WZW action is then integrated over the transverse 
directions and over Q), i.e., over all orientations of the lightcone. 


18.8 Retarded and time-ordered functions 419 


One can derive a similar result for screening for a fermion gas at high 
densities and low temperatures. We find, in a way analogous to (18.64), 


i i 
a Ai = Mie ih Px dQ Swzw(V'U) (18.91) 


where p is the chemical: potential. (In these equations, the gauge potentials 
are the antihermitian matrices A, = —it* All|, whereas we used the hermitian 
components in (18.64) since that is more conventional for Abelian fields.) 


18.8 Retarded and time-ordered functions and the 
Kubo formula 


In the last two sections, we used time-ordered products to calculate the real 
part of the two-point vertex function for A,,, and more generally the screening 
term in the effective action, which can also describe plasma wave propagation. 
However, for the imaginary part, we should use the retarded functions to get 
physically relevant results, as we shall explain below. : 

In scattering theory we ask the question that if we start with an essentially 
free particle state |i) at some time, what is the probability amplitude to 
observe the (essentially free particle) state |f) at a later time. This involves 
time-evolving the state |i) and then finding the overlap with the state |f). 
Since time-evolution is a naturally time-ordered process, we get the time- 
ordered products in the S-matrix. Notice that the state |f) evolves as in the 
free theory. If we use the Schrédinger picture, the amplitudes are therefore 
of the form (fle*#°'e-*#"|i), where Ho is the free Hamiltonian and H is the 
full Hamiltonian. If we define 


a eect) (18.92) 
we find 


iF = e*Ho(t—to) 7. et (t—to) 

= int (Qin) U (18.93) 
The fields din in the last step are defined by din = e*#o(t—to) pe—tHolt—to) | 
Therefore they evolve as free Heisenberg fields. Integration of equation (18.93) 
leads to the formula (5.72) for the S-operator 


$ —T eidint (din) = U(oo, —oo) 

2? 
U(x°,y°)=T exp / 
y? 


at least as long as we do not have derivative couplings. (With derivative 
couplings, we do not have Hine = —Lint and the situation is a bit more 
involved.) 


ada Lise (din | (18.94) 
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However, if we have to calculate the time-evolution of field configurations, 
then we need to calculate matrix elements in the Schrodinger picture where 
both states involved evolve with the Hamiltonian H; i.e., we need 


() = (2° = Ole" ge-*H2" [2° = 0) (18.95) 


or equivalently, we need to solve the Heisenberg equations of motion for the 
Heisenberg field ¢(x°) = e*#*° ¢e-##*". The question we are asking is, given 
the initial field operator, what is the field operator at a later time? The 
solution involves the retarded Green’s functions as in (5.81). Similarly for 
electrodynamics, if we apply a perturbation to the medium via the field A,, 
we can ask what the induced current is; this is the relevant quantity for the 
conductivity of the medium. In these cases, unlike in the case of the S-matrix, 
the final states are not specified. 

In the case of a scalar field, the equation of motion was obtained in equa- 
tion (5.77) as ; 
(+ m*)4(z) = p(z) ) (18.96) 


where the source p(x) was given in equation (5.83) in terms of the scattering 
operator by 


act wos 
asc) 
dSint (¢) 
= —_—_——_ 18.97 
dO(y) ee 
The analogous equations for electrodynamics are easy to set up and read 
Res: 
OF 4s) 1S : 
Gis 5A,,(a) (18.98) 
where the current J" is 
es 
J* (a) =iS~* 
(2) 35 5A, (2) (18.99) 


(While no gauge choice is specified in (18.98), it can be fixed, for example, by 
adding —4(0- A)? to the Lagrangian, whereupon the left side would become 
DA.) The scattering operator S is given by T explie f d*zA,,j"], where j4 = 
Winy' Win, is the current with the in-fields. 

We can use (18.99) to obtain J“ as a power series in A,,. In particular, a 
Taylor expansion of (18.99) to linear order in A gives 


J#(a) = ej*(x) — ie? / d*y 0(a° —y°)[j#(a),3”(y)|Av(y) (18.100) 


Using this in (18.98) and taking the average with the density matrix, we can 
write 
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a,F*M(a) = f dy Tp (2,y) Avy) (18.101) 


where 
TTR (x,y) = —te” O(x° — y°) ([j*(a), 3” (y)]) (18.102) 


The angular brackets denote the averaging, with the unperturbed density 
matrix, so that (j”) vanishes. Thus the response function (18.101), namely, 
the average of the retarded commutator (or equation (18.102)), is appropriate 
to the situation where we perturb the medium by the field and ask how the 
field evolves. Equation (18.102) is known as the Kubo formula and has been 
used extensively in condensed matter physics. 

Consider now the calculation of this for the thermal plasma. After Wick 
contractions and using the thermal propagators (18.22,18.23) and carrying 
out the time-integrations with convergence factors as before, we find 


d*k 


IT*” (x,y) = are PMR) (18.103) 
dea. lel 
BY (pL) — ee 
TOY) = | an)8 2B, 2B, * 
py Op Bq s, Og Bp 
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jv OpQg hp Ba ee cae oo) 
a ate, (are es +E,—ko ie Ep+E,—ko —ie 
Apa Bo, 
pv pq pq 
sie 0( gre ere Peete Epa ae! 
pv ff Op Bg pq =) 
ey (a—eetE =F eet ic Ey Eq ieee te 
(18.104) 


where T#" (p,q) = Tr(y4* y-p 7” 7-4) and p’* = (Ep, —p), q'# = (Eq, —@) 
and p=q-+k. 

Notice that equation (18.104) is po what we got from I?) except for 
the changes in sign of some of the ie’s. The retarded function (18.104) can be 
obtained by continuing the real part by the rule kg — ko + ie. The real part 
of IT#”(k) is the same for both the time-ordered and the retarded functions. 


Using the formula , 


z—teE 


= P= + ind(z) (18.105) 


we now calculate the imaginary part of the retarded function (18.104) for the 
momenta of the gauge fields small compared to the temperature. 

The contributions due to the T#”(p, q’) and T’”(p’,q) terms in (18.104) 
are subdominant for k small compared to T because their imaginary parts 
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carry the 6-functions, 6(E, + Eq + ko), which have no support as k — 0. 
The dominant contributions to the imaginary part of J7#” are due to the 
T” (p,q), T#”(p’, q') terms and give 


dq 1-1 
jav = pv i rs) a —_ k 
ImJIp nf (27)3 2E» 2Eq Ir (p, q) (Ng Np) (Ep Eq 0) 


Sl cia maras 


—T*” (p',q') (tig — Tip) 6(Ep — Eq + ko) (18.106) 


For |k| small compared to |p|, |q|, the latter being of order T, we have 
the same simplifications indicated after (18.56) and in (18.57). With these, 
equation (18.106) can be simplified as 


koe aa. 
hn ; 5 : dq q(q + fig) 
T 0 
kot? isp 
=P | (18.107) 
where 
pry = / dO 5(k-Q)Q"Q” (18.108) 


and we take n = n. The integration in (18.108) is over the orientations of the 
unit vector Q = q. P*” is transverse and traceless, while the é-function has 
support only for spacelike k. Carrying out the angular integration 


pH — -#4(-8) [pt + 5P| (18.109) 
where 
Ds G 5 =) - (18.110) 
and 
PS 0 
PH) = 64 — S ; (18.111) 


Here 0(—k?) is the step function, 6(—k?) = 1 for k? < 0 and 6(—k?) = 0 for 
k? > 0. 
18.9 Physical significance of Im IT2” 


We have already seen that in the case of radiation from a heated source the 
retarded function naturally emerged as the important quantity. From the 
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above discussion it is also important for the time-evolution of field configu- 
rations, the induced current, etc. In the plasma context, the imaginary part 
of IT,” describes Landau damping, which can occur for fields with spacelike 
momenta; this gives a direct interpretation for IJ”. Explicitly we consider 


A, (x) = a ert ATK) ~ (18.112) 
with 
A,(k) = 6(ko — w(k)) Ay (k) + 5(ko + w(k)) Al (k) (18.113) 


and w* < k*. The amplitudes A,,(k), Al,(k), respectively, of the positive- and 
negative-frequency terms, correspond to absorption and emission processes. 
The decay of the field in the plasma arises from absorption by fermions and 
antifermions. The amplitude for absorption by fermions is given, to the lowest 
order in coupling constant, by 


A = itip(y « A)uqg(277)*6™ (p — q — k)4/19(1 — np) (18.114) 


where up, Ug are wave functions for outgoing and incoming fermions, re- 
spectively. The factor ,/ng(1 — np) can be interpreted physically as follows. 
The initial fermion is chosen from a state of occupation number ng, which 
has a probability amplitude ,/ng, and the final fermion is scattered into a 
state of occupation number n, which carries a factor ,/1 — np in accordance 
with the exclusion principle. Absorption of a single quantum, as in (18.114), 
is kinematically allowed for spacelike momenta; it is in fact the inverse of 
Cerenkov radiation discussed in Chapter 7. One can also have creation of the 
mode (w,k) by (Cerenkov) radiation from fermions, given by a formula like 
(18.114) with p< q and A« Al. There are also similar contributions from 
antifermions. For the net absorption probability per unit spacetime volume, 
denoted by yy, we then find, with summation over all fermion states, 


34 ae al (k)T*” (p,q) Av (k) 


T” (p,q) = 2n6(Ep — Ey — ko) TH” (p,q) (Ng(1 — Mp) — p(1 — 4) 
+276(Ep ae ko) eS ps, q’) (mp(1 = Tq) — Ng (1 =< Tip) 
(18.115) 


Y= 


Comparing this with (18.106), we see that 
7 = 2Al (k) [ImJT8"(ko)| Av(k) (18.116) 


In terms of the parametrization of the potentials (18.68) and (18.69), we 
can write 


yam |i olo+- 5 (1 -=) A} Ar] (18.117) 
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where w = |k|. 7 is manifestly positive, as expected for net absorption or 
damping. 

In summary, we see that the retarded commutator is what is relevant for 
the operator equations of motion, Landau damping, etc. However, because 
of the asymmetry in its time-arguments, the retarded commutator cannot 
be obtained by varying an effective action, which necessarily involves the 
symmetric, time-ordered products. One can use a more general formalism 
where the functional integral and effective actions can be defined with time- 
integration running over a contour that goes from —oo to oo and then goes 
back from oo to —oo. This is applicable not only to near-equilibrium situations 
like the ones considered so far, but forms the general theory of nonequilibrium 
phenomena in quantum field theory. This approach will be discussed briefly 
in the next section. 


18.10 Nonequilibrium phenomena and quantum kinetic 
theory 


In classical physics, the Boltzmann equation gives the time-evolution of the 
one-particle distribution function. More generally the evolution of a statistical 
distribution is given by the Liouville equation. Since this is very hard to ana- 
lyze in full generality for a many-particle system, one makes the approxima- 
tion of truncating to the one-particle, or in some situations the two-particle, 
distributions. The description of nonequilibrium phenomena via kinetic the- 
ory is developed from these. We will now discuss briefly the quantum field 
theoretic approach to kinetic theory. 

The statistical averages of interest are calculated with the density ma- 
trix p. For a scalar field ¢, the one-particle operators are of the form 
Mes i o(z)M (x, y)do(y), where M(z,y) is independent of ¢. The aver- 
age is thus given by 


TrMp = i M(z,y)Tr[4(y)0o(2)] 


= | M(z,y) Fly,2) 


zy 


F(z,y) = Tr[4(2) p d(y)] (18.118) 


F(z, y) is a one-particle distribution function. Similarly, we can define a whole 
set of higher point expectation values, appropriate for many-particle oper- 
ators. For kinetic theory, we need some method of calculating such func- 
tions, or a set of equations for them. Notice that for y° > 2°, F(x,y) = 
Tr[pT (¢(y)e(z))] = (T'o(y)d(x)) = Gly, x). One can obtain it in terms of the 
propagator calculated in the medium. However, for x° > y° we cannot obtain 
F(az,y) in terms of the time-ordered functions. So we define the functional 
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~ bad af a —oo = 
Z| J, J] = Tr [rel * npiPed “4 (18.119) 
where T denotes time-ordering as usual and T denotes antitime-ordering, i.e., 


a —op =; ‘t ay F acre Te < 
Tela —_ ae taba © 50, a 4 (18.120) 


The expression for Z[.J, J] in (18.119) has the free density matrix po; the effect 
of the interactions is carried by the fields, since we are in the Heisenberg 
picture and the fields evolve according to the equations of motion of the 
interacting theory. It is now clear that we can write 


1 6°Z 
F(z, y) = E ee a (18.121) 


Higher correlators of ¢’s needed for multiparticle operators can also be ob- 
tained from (18.119) by further differentiations with respect to J, J. By 
cyclicity of trace we may write (18.119) as 


Z[J, J) = Tr IToed 19] = (Toe$ 1%) (18.122) 


where C denotes a contour for time-integration starting from —oo, going to 
oo and then returning to —oo. J’s on the return branch are J. If there are 
several fields invloved, we include a source function for each field. One can 
define Z by a functional integral, or by functional equations, or their partially 
summed versions, the Schwinger-Dyson equations. These equations form a 
hierarchy of equations for various multiparticle functions which completely 
characterize the theory. For example, let ¢ obey the equation of motion 


(Oo, +m? + U(2)|¢(z) — p(x) =0 (18.123) 


where p(x) depends on ¢’s and other fields which may couple to ¢. (This 
p(x) is the source for the field given in (5.83) and (18.97), not the density 
matrix; we will not use the density matrix in what follows.) We also include 
a background potential U(x) for generality. The equations of motion for Z 
are then given by 


6Z : ‘ 
A577) — p(1) L= —iJ(1) Zz 
ae. oe 


where K, =O, +m? + U(2). A(1) stands for p with the fields replaced by 
derivatives with respect to sources and / involves derivatives with respect to 
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the tilde-sources. As an example, if the interaction term is Sint = A i) ¢*x for 
two scalar fields ¢, x, then p = 2A¢y and 


: ) rf) 

A(l) = 20571) on(1) 

$19) <9) ee 

A) = Dee ey (18.125) 


where 77 is the source function for x. 

By taking derivatives of the first of equations (18.124) with respect to 
J’s and setting J, J to zero, we get equations for time-ordered correlators; 
from the second equation in (18.124) we get equations for antitime-ordered 
correlators; and both lead to equations for F(z, y) and its multiparticle gen- 
eralizations. At the simplest, one-particle level we find 


K,F(2,y) = (2) =~ Br an 


K,F(z,y) = (18.126) 


Te ah) Z 


We will now examine the structure of the terms of the right-hand sides of 
(18.126) in some detail. Taking the example of Sing =  f ¢*x, and simplifying 
along the lines of deriving the Schwinger-Dyson equations in Chapter 8, we 
find that we can write 


a 6 . * 
Ae) Siey 2x4 [ B2)F 0) - £0,296", 9) 


sree) — / [G(a, z)E(z,y) — F(a, 2z)E*(z,y)| (18.127) 
where 


E(x, z) = 4\* Fy (2,2) F(a, 2) 
3(x,z) = 47G4(z, z)G, (a, z) (18.128) 


From the definitions, we have the result 

G(a, z) = 0(2° — z°)F*(a,z) + 6(2° — 2°) F(a, z) (18.129) 
This shows that we can write 

U(x, z) = 0(2° — 2°)E*(2,z) + 0(2° — 2°) E(z, z) (18.130) 


For more general interactions, the form of the self-energies in (18.128) will 
change, but the general form of equations (18.127) and (18.130) is re- 
tained. Using (18.127) in equation (18.126) and writing out the limits of 
z°-integration appropriately, we get 
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FG@.y) = (aco -_ [ Cage) (F — Fyn] 
el y° 


KyF(x,y) = —-% ‘a (F- F\(e,2€(e)- f Fle sy Ele) 


—oo 


(18.131) 


These equations are often known as the Kadanoff-Baym equations. They are 
one of the quantum versions of the one-particle Boltzmann equation. 

The Boltzmann equation emerges from (18.131) when further approxima- 
tions are made. First we take the difference of the two equations given above 
to get 


2° 


(Kz — Ky) F(x,y) = -# / Ko =e P(E air") 6) 


- [ e@-r)-Fe-£) 
(18.132) 


We now write F(x, y) as 


F(a,y) = ) = e?@—Y) F(p, X) (18.133) 


where X = $(£ + y). The potential is taken to be slowly varying on the 
scale of the interparticle interactions, so that one can approximate U(x) by 
expanding to first order in the relative coordinate x — y. In this case 


dp |,» OF , OF av 
(ia “OXF Opens 


(K, — Ky) F(z,y) ¥i e'P(7-) (18.134) 


For the terms on the right-hand side of (18.132), we introduce the change of 
variables 


g(t@+z)=X+3(E+¢) 
d(zt+ty)=X4+5¢ 
e-y=€, z-z=C (18.135) 


and expand everything to the lowest order in €, ¢. For example, for the first 
term we get 


2? 0 
/ eer = / dice — E*)(X + 1€ + 4¢,-0) F(X +4640) 


0 
approx f_ BCEPLE ~E*)(X,-0) FIKEO 
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2 d* Pl dt P2 r 
wf ate Beas (E - E)(Xm2) F(X) 
xexplin(€+6)—tpa¢] (18.136) 


In a similar way, we find 


0 


0 
[i re-eye [atc PEE Puce - 2%) 


x exp [tp1(&€ + ¢) — iped] 


oe es ; 
R 4 , 
ff ai (27)4 (27)4 F(X, pi)(E — €*)(X, pa) 


x exp [ipe(€ — ¢) + tpr¢] (18.137) 


The two results (18.136) and (18.137) can now be combined and the integra- 
tion over ¢ and one of the momenta carried out to give 


ile e-e)F+ Fe-e) x | ore E*) F(X, pei 
(18.138) 


A similar simplification for the other two terms on the right-hand side of 
(18.132) gives 


[ve é(F — F*) . (F- FYE = | el P e(F — F*)(X,p)ei” 
(18.139) 
Using (18.134), (18.138), and (18.139), equation (18.132) becomes 


OF OF 08 
2p fH + —- —— 
OX# " Op, OX# 


= CEE to (18.140) 


where the ellipsis indicates that a number of terms, corresponding to higher 
orders in x — y, have been neglected in arriving at this result. This equation 
is the Boltzmann equation; the right-hand side is the collision integral. 

We will now simplify the collision integral to bring it to a more familiar 
form. For this purpose, we will consider the case when U(x) = 0, and also 
do a perturbative calculation. For specificity, we will continue with the two- 
scalar-field case with Sing = A f ¢*x, ¢ and y being fields of masses m and 
M, respectively. In this case, F(x, y) can be expanded as 


= Ok ik-(x—y) Bx —ik-(r—y) 
F4(z,y) = ete (e—v) 4 Ph v 
02,0) = aay 2u,.V~ eye 


= Oe ik-(x—y) Br —ik-(x—y) 
F(x, y) = y — y BOUL y ; 
x(t y) - 2,V° = 20,V° (18 141) 
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where 


a, =1+ 27k, Br = Nr 
Gp = 1+ Nx, Br = Nz (18.142) 
where w, = Vk? +m?, Q, = Vk? + M2. Equation (18.141) is similar to 


equation (18.16), but at this stage we do not assume the Bose distribution 
for n,, N;. From this form of F’, we get 


F(p) = 2n Pe =ikiygs) + res + ¥»)| 
F*(p) = 20 | P2609 — up) + S=P5(H0+u»)] (18.148 


We can use these in (18.140) and separate out an equation for a, by mul- 
tiplying by @(po) and integrating over all po. As for the right-hand side of 
(18.140), we have, to the lowest nontrivial order 


E(x, z) = 4° Fy (2, z)F, (2, 2) (18.144) 
By Fourier transformation, we get 


(2m)* st) 
E(p) = 4)? 5 2 OK, kd (p aa ky oe k2) 
ae QW, 252K, V 


+0, Bk, 5 (p — ky + ke) + Brey Ge.) (p + ka — ke) 


+ Bre, Bead (p + ki + ke) 


(18.145) 


Substituting for F, F* from (18.143), using the above expression for €, and 
taking the limit of large volumes, we get the Boltzmann equation 


pt Oap _ 42? 5 J ier aren 
ap OX" ~ Du, pbk, Jka (phe, Bre Byorr, Gs (2)46 (p — ky — ke) 
a (apBes Ok, a Brotks Br ) (21)46 (p —k+ k2) 
5 (apan, Bre — Bors dks) (21)*6() (p + ky — ka) (18.146) 
te (apc, Ok» = BoBry Bra) (2)*5) (p +k) + kz) 
where 


Peet 2 Oe 18.147 
Opis ka (27r)3 Qu, (277)? 2, ( ) 
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The first term on the right-hand side is the difference of the transition rates 
for Xk. +¢k, > dp and dp > Xky+Gx,- The second term gives the difference of 
rates for dg, “> Xko +p, the third is the similar quantity for xn, <> Or, +Op- 
The fourth term is zero for kinematic reasons. Thus we see that the right hand 
side of equations (18.132) does reduce to the standard collision integral of 
the Boltzmann equation. Equilibrium is obtained when the collision integral 
vanishes, so that a» is independent of time. From the explicit form of (18.146), 
we see that this requires the conditions 


Op Bk, Bks ia Bp Ok, Uke oe 0 
Ap Bk, Xk, — BpOk, Pr, = 0 
OpAk, Bo — BpBk, Ak, = 0 . (18.148) 


Writing the first one of these as 


Ap Ak, Ok, 


Bp _ ey Brey 


we see that this is in the form of a conservation law; log(a/3) must be an 
additively conserved quantity for collisions. The other equations in (18.148) 
give the same conclusion. If the conserved quantity is taken as the energy, 


(18.149) 


pad pe Wp 
a = ep ( = ) (18.150) 
This leads to the equilibrium solution 
il 
Np = pa (18.151) 


The parameter T’ is now identified as the temperature. More generally, the 
conservation law would include momentum; this would lead to a term pro- 
portional to p in the exponent, which can be interpreted as an overall drift 
motion of the medium. 

Our derivation of the Boltzmann equation shows its limitations. It applies 
to dilute systems for which the truncation to one-particle distributions is 
adequate. Further, the potentials have to be slowly varying. In more general 
situations, one has to use Z[J, J] as in (18.122) or analyze the full set of 
equations (18.124). The derivation of the Boltzmann equation was primarily 
to show that those equations do describe kinetic theory. 


18.11 The imaginary time formalism 


We have constructed propagators for various fields at finite temperature and 
density using the expectation values for the occupation numbers given by 


‘ 
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the appropriate density matrix. If we are interested in equilibrium proper- 
ties, rather than time-dependent phenomena, there is another method which 
is very useful for certain types of calculations. This is the imaginary time 
method, which we shall briefly discuss now. For simplicity, we shall again 
consider a scalar-field theory with the action 


S= ‘i d*z |(00¥ - sig? — r»4 (18.152) 


The equilibrium solution for the density matrix in the rest frame of the 
medium is p = Z~!e~84. The partition function is given by Z = Tre~9". 
e~°# is in the form of the time-evolution operator e~*” for an imaginary 
interval of time —i@. Using the path integral formula developed earlier, 


(ale ly) = f Idellde'] vee %(—) K(e',4,,0) 
K(¢',t, 9,0) = (y/le“"" |p) 
t 
= [tae exp ( / dt d°x cit)) (18.153) 
0 
where Y,(y) = (ya) is the wave functional of the state |a) and the functional 
integral for K(y’,t,y, 0) is over all fields with the boundary values y at t = 0 


and y’ at t. For the partition function, we introduce the variable 7, 0 < rT < £, 
with t = —ir. This gives the result 


(ale~P#|y) = if [delldy"] U2(¢'¥(~) Kaly’, 9) 


6B 
Kaly',) = / [dy] exp ( | dr dx L(t = -in) 


ig 
= i [dp] exp (- i dr d’z cs] (18.154) 


Le = —L(t = —i7) 
= 5 (ry)? + (Vo)? + mg?) +2 ¢4 (18.155) 


where 


is the Euclidean Lagrangian for the theory. Since }>, Ya(y)Wa(y’) = 4(y, 9’), 
we see that, upon taking the trace, we must impose periodic boundary con- 
ditions for the fields y in the 7-direction. This leads to 


Z= [dy] exp (—Sz()) 


periodic 


Sp(B) = i ir ee : [(A,p)? + (Vo)? + m2g?] +r y4 (18.156) 
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One can also define correlators 


(co 71:20 meee eae / (dyo)ie 0) (reg) we eae) 


(18.157) 
We thus see that the equilibrium theory is defined by the Euclidean theory 
on R3 x §$!, with periodicity of fields along the r-direetion (corresponding to 
the S' component). 

It is instructive to evaluate the propagators which are needed for the 
imaginary-time formalism. The periodicity in the 7-direction implies that 
the frequencies are quantized as w, = 2mn/B = 2nnT; these are referred 
to as the Matsubara frequencies. The propagator, which is the inverse of 
(—Ok + m7), is thus given by 


dk exp [iwn(Te — Ty) + ik - (x — y)| 
Ga(z, y) =0 | aa of. pee (18.158) 


The summation over n can be carried out using complex integration tech- 
niques. The summations involved are of the form 


n2b2 ae a2 


F(r) =) f(n,7) = > —_ + __gintr (18.159) 


Notice that this function is invariant under br — br + 27m for any integer 
m. For positive 7, we can thus evaluate this in the interval 0 < br < 27 and 
define the value of the function for other positive values of br by periodicity. 
Consider now the function 


1 
Nt) =e (18.160) 


This has poles at z = n; expanding around the pole, we see that the residue 
is 1/277. We can therefore write the summation in (18.159) as 


il : 
Ri) f (a2 + a2) (eoriz 7) “*PCHbz7) (18.161) 


where the contour C is as shown in figure 18.1, enclosing the poles due to 


Fig 18.1. Contour C for integration in (18.161) 


‘ 


18.11 The imaginary time formalism 433 


g(z). For 7 > 0 and 67 < 2z, the integrand vanishes for large positive and 
negative imaginary values of z. Therefore we can deform the contours as 
shown in figure 18.2, with the large semicircles in the upper and lower half- 
planes, picking up the contributions due to the poles of 1/(b?z? + a). F(r) 
can be evaluated as 


7 Cau 


_ 1 
=| a (1+ sae) (18.162) 


Fig 18.2. Deformed contours for the evaluation of (18.161) 


This gives the value of the integral for 7 > 0. Using this result to carry out 
the summation in (18.158) and continuing to Minkowski space by tT = iz°, 


we get 


dk 1 ey Ae 
aston] = ee Jormye n(x°—y°) 


Tx ;Ty—riz ,iy® 
+n, eive eo] eik (x—-y) 


_f fe 
7 (27) QW 


[ + ng)eh-¥) 4 mee 


(18.163) 


Comparing this with the propagator (18.16) we see that the Minkowski con- 
tinuation of (18.158) is indeed the correct temperature-dependent propagator. 
Actually, we have shown this result only for tT; — Ty > 0. The summation can 
be carried out for T; — Ty < 0 using the formula 


434 18 Finite temperature and density 


272z 
— fo OE em 1 18.164 
an (oS + a*)(e2miz = 1) exp(i 2T) ( ) 


This ensures that we have convergence when we deform the contour to the 
two semicircles. The integral is then obtained as 


7 ae ye i ee 
PT) ae ser mi. (1+ P| “| (18.165) 


The continuation of Gg(z,y) using this result shows that the Minkowski 
propagator (18.16) is correctly obtained by analytic continuation of the Eu- 
clidean propagator with periodicity in T, for both tz —Ty < 0 and tT; —T, > 0. 

The summation in (18.159) is defined in terms of the function f(n,7) 
which is defined on the integers. In writing the formulae (18.161) and (18.164), 
we are doing an analytic continuation of this function to all values of z. 
Such a continuation is not unique and this leads to the possibility of using 
different integrands in evaluating the sum by contour integration. For 7 > 0 
and for 7 < 0, the appropriate continuation is given by the one which gives 
convergence of integration over the semicircles when we deform the contours. 
This is what we have done in (18.161) and (18.164). 

So far we have discussed bosonic fields. In the case of fermions, we should 
use antiperiodicity conditions 


(7 + 8) =—Y(7) (18.166) 


The Matsubara frequencies in this case would be of the form Ww, = 2a(n+ 
5)1. The Euclidean free fermion action is S = f d4z u(y- 0+ m)y and the 
fermion propagator is then given by 


eiP: (z—-y) 


S(a,y) = aa er (18.167) 


where p4 = Wn = 2n(n + a)T. By virtue of the summation formula 


> f(n+3, 7) =f fe (18.168) 


one can check that the Minkowski continuation of this propagator correctly 
reproduces the temperature-dependent propagator (18.22), (183 23). ae de- 
nominator on the right-hand side in (18.168) has poles at z = n + $; the 
structure of this term is thus dictated by the antiperiodicity of the Bele It 
then leads to the correct thermal distribution for fermions with 1/(e%/T +1). 
This is essentially why the antiperiodicity conditions (18.166) are the correct 
prescription for fermions. Notice that since observables involve bilinears of 
fermions, periodicity of observables is compatible with the antiperiodicity of 
fermions fields. 
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Combining our discussions of fermions and bosons, the functional integral 
for a general field theory at finite temperature is given by 


z= fo dulev.d.A..) exp |-Seled.8.4..) + f Jo+m+--| 
P/AP 


(18.169) 
where the subscript P/AP on the integral indicates that the integration is 
done over all fields with periodicity for boson fields and antiperiodicity for 
fermion fields in the imaginary time direction, the period being 6 = 1/T. By 
functionally differentiating Z with respect to various source functions up to 
the apprpopriate order and setting them to zero, we can obtain temperature- 
dependent correlation functions. For J = 7 = ... = 0, Z is the thermal 
partition function. In the case of gauge fields, there is a qualification which is 
useful to emphasize. The ghost fields are anticommuting, but they must still 
have periodicity under r — T+3. This is because the ghosts are, after all, just 
a way to rewrite the Faddeev-Popov determinant. The latter is for bosonic 
fields, specifically the gauge fields, and is computed with modes which have 
periodicity conditions. Since any rewriting must reproduce this result, the 
ghost fields should be periodic in r. 

Another interesting feature which emerges clearly in the imaginary time 
formalism is the dimensional reduction in the high-temperature limit. As T 
increases, the Matsubara frequencies become large, except for the n = 0 
mode for bosons. In the mode sums in the propagators, the magnitudes of 
the various terms become small, except for n = 0 mode of bosons. Thus, 
as T becomes very large, only the mode with zero Matsubara frequency is 
important. This mode is independent of the imaginary time 7. Thus, apart 
from factors of 3 arising from integrations, the theory reduces to a three- 
dimensional theory. The high-temperature limit is given by the bosonic theory 
in one lower dimension. This reasoning can be useful, but applies only when 
there are no infrared divergences. If there are such divergences, one has to 
take account of them by resummations or other techniques, before this kind 
of dimensional reduction can be used. 

From the real-time point of view, one may understand the dimensional 
reduction as follows. If a perturbation is introduced into the medium, thereby 
creating a nonequilibrium situation, it returns to equilibrium by collisions 
and other processes in a characteristic equilibration time. As one increases 
the temperature, the equilibration time decreases. Thus asymptotically for 
large temperatures, equilibration is so fast that all time-dependent processes 
can be ignored. Effectively, this leads to the dimensional reduction. 
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We have already seen that the mass of a scalar particle gets temperature- 
dependent corrections. In a theory with spontaneous symmetry breaking, 
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the temperature-dependent corrections can make it energetically favorable 
to have a symmetric ground state at high temperatures. Thus we can get the 
restoration of symmetry at high temperatures, with spontaneous symmetry 
breaking at low temperatures; the theory is realized in a symmmetric phase at 
high temperatures and in a broken symmetry phase at low temperatures with 
a phase transition at some critical temperature T,.. A simple model which can 
illustrate this is the O(N) model with N scalar fields ¢',i=1,2,..., Newith 
a Euclidean action 


Se= f atx | 0900! + (G6! - v2) 


2 / d*x [,as'ae" - 5 (4d? )og' + \(¢'¢")? + constant 


(18.170) 
To first order in A the mass correction is given by 
dm? = 4\(N + 2)Go(z, 2) 
me peu (18.171) 


3 


where Go(z, y) is the thermal propagator in the free theory, and in the sec- 
ond line, we have shown only the temperature-dependent correction. (The 
T-independent part is absorbed into the definition of mass by the renormal- 
ization procedure as usual.) Including this correction, the term in the effective 
action which is quadratic in ¢ is '@) = 4M?¢‘¢', where 


M? = —4)v? + 4d(N + 2)Go(z, 2) 


= —4) (? = or) (18.172) 


This shows that there is a phase transition at 


} 12 
T= N+2 v (18.173) 


with the symmetry broken phase with (¢*) # 0 being energetically favored 
at T < T, and the symmetric phase with (¢*) = 0 being favored at T > T,. 
This simple calculation indicates that there is a phase transition as we go 
from low temperatures to high temperatures, with (¢) 4 0 for T < T, and 
(¢) = 0 for T > T,. However, the first-order perturbation theory is inade- 
quate concerning the details of the transition. The higher-order corrections 
are important near the transition point. This is because the effective mass M 
goes to zero at the transition point and various diagrammatic contributions 
develop infrared divergences; they become singular as M — 0. As a result, 
we cannot justify neglecting them. One needs to do a resummation of terms 
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in the perturbative expansion. The Schwinger-Dyson equations provide one 
way to address this problem. 

In the imaginary time formalism, the equations of motion and the func- 
tional integrals retain the same form as they have at zero temperature; the 
only difference is that they are defined on R° x $!, rather than R4, reflecting 
the periodicity of fields in the imaginary time direction. The Schwinger-Dyson 
equations may then be written as 


/ d4zq [K(1,2) + E(1,2)]G(2,3) = 6 (a, — 28) (18.174) 


where K = (——1]— 4)v”) and 
(1,2) = 4A(N + 2)G(1, 1)6 (ay — 2) +++ (18.175) 


G(1, 2) is now the full propagator. As indicated after (8.108), the higher terms 
in X' start off as ~ \?G(1, 2)? + .... For a constant effective mass, equations 
(18.174), (18.175) simplify as 


hd 1 


Gaon: TA (18.176) 


M? = —4)v? + an +2) fo 


with w? = k? + M?. This is in the form of a gap equation (for the energy gap 
M). The solution of this equation for M as a function of T will give a better 
approximation to the behavior near T,. Let 4p = M/T and define 


6 / e nie 1 
2 a a 18.177 
= 78 Vx? + ui? exp(s/2? + y?) -1 
This is the same integral as (18.18), except for w replacing a = m/T; the 
prefactor in (18.177) is such that Z(0) = 1. The gap equation can now be 


written as 
gg 2) +3 (2 ee i) (18.178) 


Notice that it is consistent for to go to zero as we approach T, from above. 
Thus the transition temperature is still given by equation (18.173), up to this 
order of resummation. Our calculation is valid for approach from above the 
transition point; below T., we have symmetry breaking and the calculation 
has to be done in terms of the Goldstone bosons and the one massive scalar 
field. 

The terms which are resummed by the Schwinger-Dyson equations when 
5 is approximated by 4A(N + 2)G(1, 1)5“4) (x1 — x2) are shown in figure 18.3. 
The diagrams in the first line are often referred to as the “daisy” diagrams 
and the second as the “superdaisy” diagrams. 
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Fig 18.3. Daisy and superdaisy diagrams 


When further corrections such as the \*G3(1,2)-term are added to Z, 
they generate new sequences of diagrams in addition to those shown here. 
The various types of higher-order contributions in perturbation theory can 
be classified by the powers of N they generate in the large N-limit. The term 


we have considered so far in »’, namely, (18.175), is of order AN. The next 
term in »’ is given by 


_ 32(3N +6) 


5) (1,2) = 5 


d? G3(1, 2) (18.179) 
This term is of order \?N. We define a partial resummation of the perturba- 
tion theory by reorganizing the terms in the perturbative expansion in inverse 
powers of N, by taking \ small, with AN fixed, as N becomes large. The for- 
mula (18.175) for X’, and the “daisy” and “superdaisy” diagrams it generates, 
are of the leading order in this expansion. The next correction (18.179) is of 
lower order, of order 1/N. In the large N-expansion, the resummation with 
(18.175) is exact and the gap equation shows that the effective mass van- 
ishes at the transition point. Indeed, by taking the derivative of Z() and 
evaluating it for small w and integrating, we find 


3 
Ty) 1- te (18.180) 
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The gap equation (18.178) can be solved near T, as 


2 
pe = S) (18.181) 


showing that M vanishes linearly as the transition temperature is approached 
from above. Because the effective mass is zero, one gets massless excitations 
(long-wavelength excitations) at the transition; therefore, the transition is at 
least of second order from the thermodynamic point of view. 

The characteristics of the transition for finite N are not clear from the 
calculation we have done, since the contribution (18.179) to 2’, and the fur- 
ther higher corrections which are possible, cannot be taken as being of higher 
order; they can have a nonnegligible effect. A more involved nonperturbative 
technique will be needed. 


18.13 Symmetry restoration in the standard model 


In the standard model, we have the spontaneous breaking of the symmetry 
via the Higgs mechanism which gives masses to the W+ and Z particles. At 
high temperatures, this symmetry can be restored in a manner analogous to 
what happens for the O(.V) model. In this case there are gauge particles and 
fermions in addition to scalars and the details of the transition are somewhat 
different. We can discuss this effect by an effective potential calculation, as 
we did for the zero temperature case. For an arbitrary (but constant) back- 
ground value of the Higgs field ¢, the gauge particles have masses, with 
M2 = 4(g? + g'2)¢'d, Mf, = 49?¢'¢. Therefore their contribution to the 
free energy changes with ¢ and this energy is the effective potential. In the 
imaginary time description, for a single scalar particle, we have 


3) = 5 Trlog(k? +s) 


3 
=; | a ash 


Here s is eventually the square of the mass for the field with the Higgs 
background. Taking the derivative with respect to s, we get 


1 
Se 
al an / om JERE lai (m +5) (18.183) 


~ exp = WEES) Ta 


Integrating with respect to s, we get 


(18.182) 
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I'(m?) =f the l= (ans E we + Tlog(1—e7 bios) + constant (18.184) 


where the “constant” denotes terms which are independent of m? and w, = 
Vk-k+m?. The first term on the right-hand side is the zero point energy 
and leads to the one-loop effective potential as in Chapter 8. It has to be 
evaluated with a cut-off and will require renormalization of the parameters in 
the Lagrangian. Here we are interested in the temperature-dependent second 
term, which we denote by Ir. From (18.183) we can write 


“i 2 on VEER” 
oy af d*xT(B/s) (18.185) 


Approximating Z(3,/s) for small values of G,/s as in (18.180), we get 


Ten? Tm? 
fp ———————— ose 18.1 
T = constant + A on + (18.186) 
This is the contribution for each polarization state of each boson. For the 
contribution of the W and Z bosons, we thus find 


i a 
Ty = constant + TOM} +6M3) - Jog Mz + 6Miv) + +++ (18.187) 


With the values of Mz and My given earlier, the full effective potential, to 
this order of calculation, is 


Pr = (gtd)? - a(T) dtd — o(T)(6te)? ++ 


ere Rees 2\ § 
+ 
(4) +6(2) | (18.188) 


a(T) changes sign at some temperature T, defined by a(T.) = 0, with a > 0 
for T < T, and a < 0 for T > T,. The cubic term ¥(otd)2 is very important 
in determining the nature of the transition. Ignoring the constant part which 
is irrelevant for this discussion, we see that I = 0 at ¢ = 0. As ¢ increases, for 
small ¢, ad! ¢ is the leading term and we see that I’ decreases. As ¢ increases 
further, the 7(¢'¢)2-term decreases I further. Eventually the \(¢'¢)?-term 
dominates, making I" increase. Thus for a > 0, the minimum is at a nonzero 


value of @ given by 
Be v 32Aa + 9y? + 3y 
8A 


127 


(18.189) 
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This gives symmetry breaking at low temperatures. When a = 0, I still de- 
creases as we increase ¢ starting from zero. There is still a nonzero minimum 
and we still have symmetry breaking. When a < 0, I’ initially increases with 
¢, then comes down due to the y(¢'¢) 3-term and then increases again due to 
the \(¢'¢)?-term. There are thus two minima, ¢ = 0 and ¢ = (¢), given by 
(18.189). For small |a|, (¢) remains the global minimum. For large |a|, ¢ = 0 
is the global minimum. There is a potential barrier between the two minima, 
as shown in the figure below. Any transition from ¢ = (¢) to the minimum 
at ¢ = 0 has to proceed by quantum tunneling because of this barrier. 


a<0 


(arcs (0) 


[| 


a>0O 


Fig 18.4. The effective potential for different values of a 


Suppose we start from low temperatures in the phase with the broken 
symmetry. As the system heats up, eventually ¢ = 0 becomes the global 
minimum. But the system remains in the local minimum at (¢), with the 
broken symmetry. This is a metastable state. Eventually there is a tunneling 
transition to ¢ = 0 and the symmetric phase. The barrier becomes smaller 
and the tunneling is easier if the temperature is further raised. What we have 
described is typical of a first-order transition. The energy difference between 
the local metastable minimum at (¢) and the global minimum at ¢ = 0 is the 
latent heat of the transition. Thus in the one-loop approximation we have 
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used, we find that the standard model undergoes a first-order transition to 
the symmetric phase if the temperature is raised to a sufficiently high value. 

We have included the contribution of the gauge bosons to the effective 
potential. A more comprehensive analysis, even within the one-loop approx- 
imation, will require the contribution of the Higgs particle and the fermions. 
Further, close to the transition point, one has to include higher order cor- 
rections as in the O(N) model to obtain a satisfactory description of the 
dynamics of the transition. 
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19 Gauge theory: Nonperturbative questions 


19.1 Confinement and dual superconductivity 


19.1.1 The general picture of confinement 


We have seen in Chapter 10 some of the perturbative aspects of Yang-Mills 
theories. Since the physical realization of gauge theories is the standard 
model, most of the perturbative calculations which have been done are in 
this context. Various processes, and radiative corrections to them, have been 
calculated and checked against experimental data for the electroweak the- 
ory. The SU(3) gauge theory of strong interactions has an unbroken gauge 
symmetry and our discussion of the effective charge and asympotic freedom 
shows that perturbative calculations are reliable at high energies. A large 
number of processes have been analyzed at high energies and compared with 
experimental data. The basic conclusion of all these calculations is that the 
standard model is in very good agreement with experiments for all processes 
in the kinematic regimes where reliable perturbative calculations can be done. 
However, there are many situations for which the perturbative analysis is not 
adequate; the most notable and physically relevant case is the low-energy 
limit of quantum chromodynamics (QCD). By now there is some understand- 
ing of the qualitative features of the theory in the nonperturbative regime, 
although most of the quantitative questions have not been answered. In this 
chapter we will discuss some of the approaches to nonperturbative issues. 
In an unbroken Yang-Mills theory with a small number of fermion or 
scalar matter fields, we still have the property of asymptotic freedom. This 
shows that the effective coupling constant of the theory increases as we go 
to processes of lower and lower momenta. The A-parameter defined by the 
formula for the running coupling constant defines the basic scale of the the- 
ory; fields of momenta much larger than A can be treated perturbatively, but 
all modes of momenta comparable or less than A have to treated nonpertur- 
batively. The perturbative formula for asymptotic freedom is no longer valid 
near A, so we can only conclude that the tendency of the theory is to move 
to stronger effective couplings at lower momenta. The general expectation 
is that this will lead to interparticle potential energies which increase with 
distance and hence to the confinement of all particles belonging to all rep- 
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resentations of the group except the identity representation which has zero 
charge. This confinement hypothesis may be stated as follows: 


The asymptotic states of a nonabelian Yang-Mills theory with 
group G are all singlets (identity representation) of G. In particular, 
in QCD, quarks do not appear as asymptotic states; they can only be 
constituents of color singlet combinations and cannot be separated 
arbitrarily far off from the rest of the constituents. 


Recall that the transformation of the gauge potential is given by A — A9 = 
gAg—! — dgg~', where g(x) : R? — G, with g equal to a constant at spa- 
tial infinity. Among these transformations, those which go to the identity at 
spatial infinity are the true gauge transformations; the set of such transfor- 
mations were denoted by G, and they act as identity on the physical states. 
The transformations which go to a constant g, not necessarily equal to 1, form 
the set G. We argued earlier that G/G, = G, the Lie group in which g takes 
values. The transformations which go to a constant, not necessarily equal 
to the identity, are equivalent to a G-transformation; they act as a Noether 
symmetry and the states can be classified by representations of G. Nontrivial 
representations correspond to the charged states of the theory. Thus, at this 
kinematic level, there is nothing against charged states. Confinement is thus 
of a dynamical origin. 

If quarks are confined, the obvious next question is about the interquark 
potential. Studies of bound states of heavy quarks such as charmonium and 
the upsilon show that the interquark potential, at least for these heavy quarks, 
is linearly increasing with the separation, V(r) = or. Such a linearly increas- 
ing potential shows that the chromoelectric flux between the quarks is drawn 
out into a thin flux tube of some constant nonzero tension, so that the en- 
ergy of the tube, which is the energy of the separated quark-antiquark pair, 
is proportional to the length of the tube, which is r. A chromoelectric string 
connects the quark-antiquark pair. One may now ask how such a flux configu- 
ration can be physically realized. In the theory of superconductivity, one has 
the Meissner effect, which leads to exclusion of magnetic fields from super- 
conducting regions. In type II superconductors, where the coherence length 
is smaller than the London penetration depth for magnetic fields, it is possi- 
ble for magnetic fields to penetrate into a superconductor without destroying 
the superconductivity completely. In this case, the magnetic field is squeezed 
by the superconductor into as narrow a cross section as possible (within the 
limits imposed by energy minimization), and the result is a long tube of mag- 
netic flux. These are the Abrikosov vortices, which have a constant energy 
per unit length. The total flux in such a vortex has to be quantized and is 
of the appropriate value as to connect a monopole-antimonopole pair. The 
interaction energy of a monopole-antimonopole pair, if such a pair were phys- 
ically possible, immersed in a type II superconductor is thus proportional to 
the separation of the two particles. The situation is very similar to what 
happens with quark-antiquark pairs except that in the latter case the charge 
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is of the electric kind rather than the magnetic kind. If we consider a mag- 
netic superconductor due to a vacuum condensate of magnetic monopoles, 
we would have squeezing of electric flux lines and the formation of electric 
flux tubes between elctrical charges. Thus quark confinement with a linear 
potential for heavy charges can be understood if we take the vacuum state 
of QCD (or any unbroken Yang-Mills theory with a small number of matter 
fields) as a chromomagnetic superconductor formed due to the condensation 
of chromomagnetic charges. 

In an ordinary superconductor, one can define a field which behaves as 
an order parameter. This is the effective field ¢, representing the Cooper 
pairs, or the paired electrons. ¢ has a charge 2e, couples to the electromag- 
netic field, and has the ground-state expectation values (¢) # 0 for T < Ti, 
corresponding to the superconducting state, and (¢) = 0 for T > Ty, corre- 
sponding to the normal state. Thus, the phenomenon of superconductivity 
is the Higgs phenomenon of spontaneous symmetry breaking in the Landau- 
Ginzburg model which is the (nonrelativistic) Abelian Higgs model. In the 
context of Yang-Mills theory, we have to address the question of how to go 
to the dual picture of chromomagnetic, rather than chromoelectric, super- 
conductivity, whether we can find an order parameter field, how we can get 
vortices, how the vacuum state can be characterized, etc. We will begin by 
relating the interquark potential for heavy particles to the Wilson loop oper- 
ator. 


19.1.2 The area law for the Wilson loop 


We start with the definition of the parallel transport operator 
x ab 
U(x, y, A) = [Pexp (- f A,ac")) (19.1) 
yC 
Under a gauge transformation A — A9, this transforms as 


U(x, y, A%) = a) Pexp (- ie A,as"’) mw| ab 


= [9(2)U(2,y, A)g*(y)]" 


As discussed in Chapter 10, the Wilson loop operator is given by 


(19.2) 


Wa(C) =Tr [Pexp (- f . Ayds!)| = TrU(a,2,4,C) (19.3) 


where the integral is over a closed curve C starting and ending at x. The trans- 
formation law (19.2) shows that Wr(C) is gauge-invariant. The subscript R 
refers to the fact that A, = —it*A%, are taken to be in a representation F of 
the group G. 
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In order to relate this to the energy of a particle-antiparticle pair, we will 
start by considering the Euclidean spacetime process where a heavy static 
particle-antiparticle pair is created at a certain time x°, propagates to time 
x° + T, and then is annihilated. The gauge-invariant operator which creates 
a particle-antiparticle pair separated by a spatial distance r is given by 


Ft(2®,21,a° +r) = pl? (2)U(2; yx? (y) (19.4) 


Here x = (x°, x), y = (x°, x! +r). For simplicity we take the separation of the 
pair to be along the x!-direction. ¢' and x! are creation operators for heavy, 
static particle and antiparticle, respectively. Under a gauge transformation, 
¢ — g¢, X > g*x, , so that Ft is gauge-invariant. For heavy particles with 
the mass tending to infinity, the action is given by 


Stsae / dn Teg! Doo + iy Dox] (19.8) 


where Dod = 0:6 + Aod and Dox = O% + Ajx are the covariant derivatives 
of ¢@ and yx, respectively. 

Let H be the Hamiltonian corresponding to the Yang-Mills theory with 
these matter fields ¢, x in the Ap = 0 gauge. The Ap-dependent terms in 
(19.5) are then zero but will contribute to H via the Gauss law. We can then 
write 

(O|F e 2? Ft0) & NM e POT (19.6) 


where NV is some prefactor related to the normalization of F’, and E(r) is the 
energy of the pair. We will consider T to be large. When T becomes large, we 
get the energy of the lowest energy state which can be created by F’. Since 
the particles are heavy and static, E(r) is just the interaction energy of the 
pair due to the gauge field. We may rewrite (19.6) as a Euclidean functional 
integral 


(OF e~#T Ft\0) = / du(A, b,x) exp [-Se(A,6,x)] 
x*(y/ U4 (y’, 2')b4(a') g!2(x)U? (x, y)xt?(y) 


(19.7) 
where x! = (x° + T,z'), y! = (2° + T,2! +r). The Euclidean action for ¢, x 
is given by 
a) 6) 
Sxz(¢,x) = jas leat e + xt | (19.8) 


with the propagators 


(°(x)d™(a")) = 6%0(7 — 7')6 (x — y) 
(x*(a)xP(a')) = 69O(r — 7’ )5O@—y) (19.9) 


The functional integral (19.7) then reduces to 
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(OF e~PTtY0) = f du eS” U(y/, 2") U%(2,y) 
= (h du e~S= Wr(C) (19.10) 


where C’ is the rectangle with vertices z,y,zx’,y’. Combining (19.6) and 
(19.10), we see that 

(Wr(C)) & Ne BOT (19.11) 
Thus, evaluating the Euclidean expectation value of a large Wilson loop, we 
get the interaction energy of a heavy static particle-antiparticle pair. Even 
though we showed this in Ag = 0 gauge, Wr(C) is gauge-invariant and so are 
energies of gauge-invariant states, so that the argument holds true in general. 
For a heavy quark-antiquark pair, E(r) = or, and hence we may encode this 
piece of information as the statement 


(Wr(C)) = N exp(—o r T) 
m= N exp(—a A(C) ) (19.12) 
where A(C) is the area of the minimal surface whose boundary is C. 
In order to relate this “area law” for the Wilson loop to the picture of dual 
superconductivity, we need to discuss vortices in an ordinary superconductor, 


show that this leads to a linear potential between monopoles, and then go to 
a dual picture. So we now turn to vortices. 


19.1.3 Topological vortices 


The nonrelativistic version of the Abelian Higgs model is the Landau- 
Ginzburg theory of superconductivity. Vortex excitations in this model were 
first analyzed by Abrikosov. We will consider the relativistic model where the 
pioneering work on vortices is due to Nielsen and Olesen. 

The action for the Abelian Higgs model, given in (12.119), is 


s= fate [- Phar +(D,8)"(D"8) - V0) 
V(¢"¢) = A(o*¢ — v?/2)? (19.13) 
The Hamiltonian corresponding to (19.13) is 
H= / dx E (E? + B?) + $°$ + (Did)* (Did) + x) (19.14) 


The minimization of H as a functional of the fields and derivatives is obtained 
by 
V(¢*¢) = 0, B= B=) (19.15) 
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¢@ = vV2 is a solution to these conditions. The classical vacuum is thus given 
by ¢ = v/V2. Building up a vacuum state with this expectation value, we 
get the standard Higgs mechanism as discussed in Chapter 12. 

We now consider static classical solutions of finite nonzero energy. For 
finite energy, the integrand in (19.14) has to vanish at spatial infinity so that 
we can have convergence of the integral. This requires that the fields go to 
a solution of the conditions (19.15) at spatial infinity. Since V has to be 
zero, the field ¢ at spatial infinity is of the form @ = (v/V2 V2) )exp(i6). This 
gives a mapping e”® : S? — U(1), from S? which is spatial infinity to U(1). 
The homotopy classes of such mappings are given by [J2(U(1)) = 0, so that 
the field space has only one connected component and minimization leads to 
the vacuum again. However, we will get nontrivial topological sectors if we 
consider vortices which have finite energy per unit length. The simplest case 
is to take a straightline vortex, which we take to be along the third axis. The 
energy per unit length of static configurations is given by 


c= fae E B? + (Did)"(Did) + V(6" 6) (19.16) 
For this to be finite we need ¢ > — _— as r? = x? + 22 — 00, giving 


eee rt) (19.17) 


The homotopy classes of such maps are given by J7;(U(1)) = Z. There are 
winding numbers for these maps given by 


Sift a 
oa) Pe a ts - 
at =¢ odo. 19.18 


The field space for configurations of finite energy per unit length has an 
infinity of connected components labeled by Q. In the simplest nontrivial 
case of Q = 1, we need ¢ — (x + izg)/r as r — oo. In a more general 
context, if we have a gauge symmetry G which is spontaneously broken to 
H, then the relevant homotopy group for vortices is IJ,(G/H). 

For finiteness of energy, we also need the term (D;¢)*(D;¢) to vanish as 
x} + x3 — oo. This can be achieved by choosing an ansatz for A; such that 
0; is canceled by —ieA;¢. In this case, we can write 


€ 


an |z|—00 
27 
/ er F = —Q (19.19) 


Thus a configuration with winding number Q may be interpreted as Q flux 
tubes, each carrying magnetic flux 27/e. Notice that this elementary flux is 
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exactly what is needed to connect a Dirac monopole, which has magnetic 
charge 1/2e, to a Dirac antimonopole. The quantization of the magnetic flux 
carried by the vortex is a consequence of the topological origin of the vortex 
as being due to nontrivial I7,[U(1)]. 
For the case of Q = 1, a suitable ansatz is given by 
o= a h(r) e%, eA; = = 


The equations of motion for static fields then become 


eee ree) 
dé\édéJ) 


1d (ott) _A(f-1)? M2 


a Ye 


where € = evr. There are two relevant scales for the analysis of vortices; these 
are given by the mass of the vector particles My = ev and the mass of the 
Higgs scalars M7 = V2 v. The asymptotic behavior of the functions f, h 
which leads to finite energy per unit length, is 


6) (19.20) 


a? == 0 (19.21) 


f + 1+a exp(-—€) 
h—-1-—-£ exp(—Mur) (19.22) 


Type I and type IT superconductors correspond to 


—— <l Type I 
>1 Type II (19.23) 


For type I, the Higgs field falls off more slowly than the gauge field; as a 
result the flux is contained in a region where the Higgs field is close to zero. 
For type II, the gauge field falls off more slowly, giving a flux tube in the 
superconductor. While an exact analytical solution for the equations (19.21) 
is not known, many of the essential features follow from approximate analysis. 
Equations (19.21) are the variational equations for the energy functional 


E = 2nv? fea [5 (G+ hi? + mr) + Aue ay] (19.24) 


Using inequalities on this functional (or the more general form (19.16) ) one 
can show that vortices in a type I superconductor attract each other, making 
it favorable for them to coalesce, while in a type II superconductor they repel 
each other, leading to well-defined multivortex configurations. 

For simplicity, we have so far considered a vortex along the x-axis. One 
can consider fluctuations of this vortex corresponding to bending of the vor- 
tex. One can in general get a vortex along some curve C’. As noted before, 
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vortices have to end in monopoles since the flux has to terminate in magnetic 
charges. But it is also possible for vortices to form closed loops, since there 
is no difficulty with flux conservation in such a situation. 

We will now consider very thin vortices which can be obtained when My 
and My become very large, with My >> My. We want to introduce an 
operator which can create such a vortex. Let C’ denote the vortex as a curve 
in three-dimensional space with coordinates z*(r). The magnetic flux for this 
vortex is given by 


dz* (3) 1 A : 
F=4ng | dr ——6°’(@—2(r)) =Gj4dr° A dx? 
Cc dr 2 
=F, ; (19.25) 


where g = on is the strength of the vortex. It is easily checked that the 


integral of F over a surface intersecting the vortex transversally is 27/e as 
required by (19.19). Let a be a one-form such that da gives the right-hand 
side of this equation. In other words, it is a specific potential which gives the 
magnetic field of a vortex. Notice that F' is zero everywhere except on a line 
C', which is the vortex. Thus a is a closed one-form on all of R* — {C’}. 

We will now construct an operator which creates a vortex. In the Ap = 0 
gauge, the action for a U(1) gauge theory is 


S= i d‘r 5 (0>A:apA.) _ 5| (19.26) 
The equal-time commutation rules are thus 


[A:(z), E;(y)] = 1 6 (x — y) (19.27) 


In the full Hamiltonian anlysis, we have to impose the Gauss law as well. In 
our discussion of the electromagnetic field, we solved the Gauss law explicitly. 
An alternative is to require that the physical states are invariant under gauge 
transformations. This is mathematically the statement 


(V+ E—Jp) |b) =0 (19.28) 


for all physical states |y). It is convenient to use this approach in constructing 
the vortex creation operator. 
The operator which can create the vortex line in (19.25) is then given by 


T(C’) = exp (-: / dx aE) (19.29) 
This gives 
T~*(C’) Ai(x) T(C’) = Ai(z) + a; (zx) (19.30) 


where we have used the canonical commutation rules. Magnetic flux across a 
surface ' can be measured by $, A, where 0X' = C. Using (19.30) 
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T-(C") if Al T(C') = f (A¥a) 
= £ A+ ik F, (19.31) 


We see that the magnetic flux is shifted by the flux of a vortex by the operator 
T(C’) confirming its interpretation as a vortex creation operator. If the vortex 
line does not intersect with the surface 2’, then the integral [ x Fy is zero; if 
the vortex line intersects with the surface more than once, we get an identical 
contribution for each intersection. Thus, 


| Pea (19.32) 
Dy 


where L(C,C’) is the number of times the curve C links the curve C’. It 
is called the Gauss linking number. This can have a topologically invariant 
meaning only if C’ is infinitely long or if it is closed. We will consider C’’s 
which are closed. The Gauss linking number of two closed curves C’ and C’ 
may be written as 


1 pa)?» 
MCC’) = cr ¢ ¢ | Cijk = us dy’ dax* (19.33) 


Equation (19.31) may now be rewritten as 
| f Al T(C’) =T(C’) | Al + T(C’) 4ng L(C,C’) (19.34) 
In the Abelian case, the Wilson loop operator becomes 
WNC) = exp E a Al (19.35) 
Consider now the identity 


exp [ie f | E;(r) exp ie f | = E;(x) + ef 50 (x — 2(r)) = dr 
(19.36) 


This shows that we may also think of the Wilson loop operator as creat- 
ing a thin electric flux tube (or electric string) along the curve C. A dual 
interpretation for the two operators T(C’) and W(C) emerges from these 
considerations. One may think of T(C’) as creating a magnetic flux tube, 
with to A measuring the magnetic flux through C. W(C) can be thought 
of as creating an electric flux tube, with the electric flux being measured by 
T(C’). The commutation rules may be written, in exponentiated form, as 


W(C) T(C’) = exp (i4meg) T(C’) W(C) 
= T(C’) W(C) (19.37) 
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The additional phase factor gives 1 because of the quantization of the flux 
carried by the vortex, eg = n/2 for some integer n. Therefore, this commu- 
tation rule may be taken as another way of stating the Dirac quantization 
rule. 

The operators T(C’) and W(C) provide a way of specifying the phase 
of the gauge theory without relying on the details of the mechanism which 
produces the superconducting phases. If we introduce a dual potential A for 
the electric field so that Ej = €:j~0; Ar, then we can write 


T(C') = exp (-itrg ¢ A) | (19.38) 


Even though there are some restrictions on when such a dual potential can 
be introduced, this equation shows that we may indeed think of T(C’) as the 
dual of W(C) under an electric-magnetic duality transformation. We have 
seen that confinement of electrically charged particles can be stated as the 
area law for the Wilson loop. Confinement of magnetically charged parti- 
cles occurs in the usual electric superconductor with a vortex connecting 
monopole-antimonopole pairs. The energy of such a configuration increases 
linearly with the separation of the particles since the energy of a vortex is 
proportional to its length. Since T(C’) creates a magnetic flux tube, by argu- 
ments similar to what we used for the Wilson loop, the linear potential for the 
monopole-antimonopole pair will give an area law for the expectation value 
of T(C’). Going back to W(C), we then see that the area law for its expec- 
tation value can be interpreted as being due to magnetic superconductivity. 
To summarize 


(T(C")) ~ exp(—a A(C’) ), electric superconductivity, 
magnetic confinement (19.39) 
(W(C)) ~ exp(—aA(C) ), magnetic superconductivity, 


electric confinement 


The advantage of defining the phases in this way is that it is a gauge-invariant 
specification purely in terms of operators constructed from gauge fields; it 
does not involve other fields such as the Cooper pair field, whose nonabelian 
analog is not easy to construct. Further, duality is very transparent. To ex- 
tend this characterization to the nonabelian case, we need to consider the 
nonabelian analogs of the operators T(C’) and W(C). 

The operator T(C") was introduced in the context of the characterization 
of phases of a gauge theory by ’t Hooft and it will therefore be referred to as 
the ’t Hooft loop operator. 


19.1.4 The nonabelian dual superconductivity 


As a first step toward the generalization of the ideas of the last subsection 
to the nonabelian Yang-Mills theory, we will consider the construction of 
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operators T(C’) in a U(1) subgroup of an SU(N) gauge theory. The SU(N) 
group has nontrivial central elements, namely, there are some elements of 
SU(N) which commute with all the elements of SU(N). The central elements 
of SU(N) form the cyclic group Zy whose elements can be represented as 
the set of (N x N)-matrices {1, w, w?,---,w-1} with w = exp (2mi/N). As 
a matrix, all these are proportional to the identity and evidently commute 
with all of SU(N). An explicit construction is obtained as follows. Let ta, 
a = 1,2,---,(N* — 1), be a basis of the Lie algebra of SU(N), realized as 
(N x N)-matrices. As usual the normalization condition is Trtgt, = 5 Sab: 
Consider 


ty2_4 = 


= Van Y (19.40) 


The group element 
N-1 
exp (20 1B Dew] = exp(2miY) = e27/N = y (19.41) 


This corresponds to choosing the group parameter corresponding to ty2_4 
as O@y2_, = 27,/2(N — 1)/N. Fields in the adjoint representation transform 
as d — ¢' = g¢g!, ¢ = ota, g € SU(N). These are invariant under Zy. 
All representations obtained from reduction of tensor products of the adjoint 
representation are also invariant. For all such fields, including the gauge field, 
which is in the adjoint representation, the group is actually SU(N)/Zwn. 
In particular, all components of all fields which are Zy invariant will have 
integer eigenvalues for Y. Matter fields in the fundamental representation are 
not Zy invariant since they transform as ¢’ = g ¢; for them, Y has fractional 
eigenvalues. 

The group SU(N) is simply connected, but if all fields are Zy invariant, 
the relevant group is SU(N)/Zy, and this is not simply connected. In fact, 
II,[SU(N)/Zyn] = Zw and hence if we consider a theory where all fields are 
Zn invariant, the spontaneous breaking of the symmetry completely will lead 
to Zn vortices. We want to analyze these vortices in some more detail. 

We will start with just the U(1) subgroup of SU(N) defined by the gen- 
erator ty2_; or Y. The covariant derivatives are of the form 


Dud = O,6 — ieV Aud (19.42) 


(We use Y, absorbing the normalization factor ,/N/2(N — 1) into the field 
A,,.) One can break this U(1) group by Higgs fields which are in some rep- 
resentation of SU(N)/Zy. For example, there are many components of a 
Higgs field in the adjoint representation which carry nonzero eigenvalues of 
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Y and so transform nontrivially under this U(1); they can be used to break 
the symmetry. If this U(1) is broken, we get vortices as discussed before. 
Since the eigenvalues of Y are integers, the magnetic flux of the vortices will 
be quantized in units of 27/e. One can then construct the operator T(C’) as 
before. Consider now the Wilson loop operator defined in (19.3). Unlike the 
Abelian case, we cannot use ¢, A as a gauge-invariant operator. Only the ex- 
ponentiated form, with the path ordering of the Lie algebra matrices and the 
trace, is invariant. Therefore we must ask whether the flux of a vortex can be 
detected by measuring it with W(C). For Wilson loops in the fundamental 
representation, the values of Y are 1/N or —1+ (1/N). In this case, we see 
immediately that 


Wr(C) T(C') = ex 


2miL(C,C’) 
P N 


) T(C’) We(C) (19.43) 


where T(C’) creates the basic vortex of flux 27/e. The basic magnetic vortices 
can be detected by use of W(C) in the fundamental representation. There are 
no dynamical charges in the fundamental representation, so what is meant 
is that by use of heavy external (nondynamical) charges which are in the 
fundamental representation, we can detect the basic vortices, of fluxes k 27/e, 
k = 1,2,...,(N — 1), or more generally fluxes modulo N, in units of 27/e. 

At the level of just the U(1) field, there are also vortices with fluxes 
which are N times the basic unit of 27/e, or multiples thereof. An operator 
T(C’) for them will commute with Wr(C), and hence with W(C)’s in any 
representation. They cannot be detected. In fact they have no meaning, once 
we embed the chosen U(1) subgroup in the full SU(N); the only meaningful 
flux values are modulo N since 17,;(SU(N)/Zn] = Zn. 

In the nonabelian theory with Zy invariance, since the only relevant fluxes 
are Zy-valued, we can define the operator T(C’) by the commutation rules 
(19.43). Notice that if T(C’) is defined by (19.43), then effectively the oper- 
ators T(C’) and W(C) are expressed entirely in terms of gauge fields and do 
not depend on ideas of Higgs fields and spontaneous symmetry breaking which 
we used to arrive at it. The commutation rule (19.43) as the general algebra 
for the electric and magnetic vortex creation operators was first obtained by 
’t Hooft and hence is often called the ’t Hooft algebra. The interpretation of 
Wr(C) as an operator creating a chromoelectric string (with chromoelectric 
flux value appropriate for connecting a heavy particle pair in the fundamen- 
tal representation) and measuring chromomagnetic flux is still valid. T(C’) 
creates a chromomagnetic string and measures chromoelectric flux. We can 
now use these operators to characterize a notion of duality and phases for the 
gauge theory. We have already seen that confinement of heavy quarks via a 
linear potential is characterized by an area law for the expectation value of 
the Wilson loop. This may be taken as the definition of the phase of chromo- 
magnetic superconductivity. The dual phase would correspond to an area law 
for T(C’) which may be taken as a phase of chromoelectric superconductivity. 
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The Wilson loop Wr(C) thus serves as an order parameter if all the 
dynamical fields are Zy invariant. This analysis does not in any way prove 
or even provide a logical argument for confinement. All that we have done 
is to give a qualitative picture which singles out area law for the vacuum 
expectation value of Wr(C) as the result to be proved in order to demonstrate 
mathematically the confinement of heavy charged states which are not Zn 
invariant. It would show that the vacuum state is a state of chromomagnetic 
superconductivity. 

We now turn to what happens in a theory with matter fields in the funda- 
mental representation or other representations which are not Zy invariant. 
This is clearly the case of physical interest since chromodynamics has quarks 
in the fundamental representation. In this case, the relevant group becomes 
SU(N), which is simply connected, and there are no vortices. We can no 
longer characterize phases of this theory in terms of order parameters W(C) 
and T(C’). This is not to say that there is no notion of confinement; it can 
be defined in terms of all the asymptotic states being singlets, but we can 
no longer use (W(C)) as a probe or signal of confinement. One can also see 
more explicitly why (Wr(C)) will not work when there are Zy noninvariant 
fields such as quarks. In this case, there is some probability of creating a 
quark-antiquark pair if sufficient energy is available. Thus it is possible for 
the electric string connecting the heavy external charges to break, the flux at 
the broken ends getting attached to the quark and antiquark. Such a process 
is consistent with Gauss law. Effectively, the external charges are screened by 
the newly created particles which are attached to them by short strings. The 
operator F' of equation (19.4) creates a very unstable state if the separation 
is large; the state of external charges connected by a long string loses identity 
as it acquires a large decay width. 


19.2 ’*t Hooft-Polyakov magnetic monopoles 


We have argued that the condensation of monopoles forming a state of dual 
superconductivity can lead to the confinement of charged particles for a non- 
abelian gauge theory. The monopoles themselves were not directly used in 
this argument. Nonabelian theories which are not spontaneously broken ad- 
mit configurations of magnetic charge and this is crucial for the dual super- 
conductor picture. However these monopoles do not appear as stable config- 
urations in the theory. This is why they were not explicitly used in all the 
arguments. 

In a spontaneously broken gauge theory monopoles can appear as stable 
classical solutions of finite energy and these provide a way to understand 
many of their properties in general. These solutions, known as ’t Hooft- 
Polyakov monopoles, retain their identity under interactions due to a topo- 
logically conserved charge, the monopole number. In other words, they are 
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solitons. In the quantum theory, they will then survive as particle-like exci- 
tations. Monopoles can also appear in any grand unified theory where the 
electromagnetic U(1) group is embedded in a semisimple nonabelian group. 
Therefore they can be of interest in their own right as possible particle-like ex- 
citations in the theory. We will discuss monopole solutions of a spontaneously 
broken theory briefly here. 

A question that might arise at this stage would be about why we need 
to consider spontaneously broken nonabelian theories. If we are interested in 
a theory of monopoles, why not just build a theory of Dirac monopoles in a 
U(1) theory? In fact we have already discussed some of the properties of a 
Dirac monopole and obtained the quantization condition for the charge of a 
monopole. However, the field has a singularity at the origin or the position 
of the magnetic monopole and the classical energy is infinite. This is similar 
to what happens for an electron and one could attempt to treat a monopole 
as a fundamental particle with an associated field operator and absorb the 
infinity of energy as a mass renormalization. However, until now, a completely 
satisfactory theory of charges and monopoles along these lines has not been 
constructed. 

Consider a nonabelian gauge theory with symmetry G which is spon- 
taneously broken down to a subgroup H. The potential energy is invariant 
under G-transformations of the Higgs field, and hence the finiteness of energy 
for static configurations requires that the asymptotic value of the Higgs field 
should be the vacuum value, upto a G-transformation. Since H leaves the 
Higgs expectation value invariant, we see that, asymptotically, Higgs field 
configurations are in G/H. We thus have, asymptotically, a map from S? 
(corresponding to the large radius sphere in R°) to G/H. The homotopy 
classes of such maps are given by I/2(G/H). If this is nonzero, we can have 
magnetic monopoles. The exact homotopy sequence 


(19.44) 


shows that, if G is simply connected, IJ2(G/H) = 1,(H). 

A simple model which realizes this structure is the spontaneous breaking 
of an SU(2) gauge theory down to a U(1) subgroup. The Higgs field may be 
taken as a triplet under SU(2) and the action is given by 


ier -7Rerem + 5 (Dyd)*(D*4)* — vor") (19.45) 
where 
(D,.)* a bt + ect AP f° 
V(¢°o") = : (¢°¢* — v2)? (19.46) 
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a,b,c = 1,2,3. Since the Higgs field is a triplet, the symmetry group is 
actually SO(3) rather than SU(2) and the symmetry breaking is SO(3) > 
SO(2). 


The energy for static configurations is given by 
: 1 1 
e= f oc] 5 Bret + 5D.9)"(D.9)" + VEO") (19.47) 


The classical vacuum configuration is at the minimum of V; we may take 
this as ¢* = 6*v. The isotropy subgroup is SO(2) corresponding to group 
rotations around the a = 3 direction. We have a Higgs breaking of the gauge 
group to SO(2) ~ U(1). The gauge field components W* = (A! + iA?)//2 
get mass My = ev. The unbroken SO(2) direction will be taken as the 
electromagnetic U(1) gauge group of this model. (Since ¢% are real, it is 
convenient to denote the vacuum expectation value as v rather than v/V/2.) 
For finite energy configurations, we need 


ee, (Di¢)* — 0 
b°6* > v?, , asr=|x|—0o (19.48) 


The last condition shows that as r — oo, we have a mapping ¢* : S? — S?. 
There are homotopy classes corresponding to IT2[SO(3)/SO(2)| = I2[$?| = 
Z, characterized by the winding number 


= — $ ere 6% dd? n dd° (19.49) 


where $* = ¢7/|@|. (This was discussed in Chapter 14.) Q is identically con- 
served; classical and quantum transitions must preserve Q. For the vacuum 
configuration ¢° = dv, evidently Q = 0. With the asymptotic requirement 
of D@ vanishing, we can write 


$ e2¢g2(D¢)’ A (Dd)° = 0 (19.50) 
Simplifying this equation, we find 
¢ F¢g? = = Q (19.51) 


where F? = dA® + 5 eabe AP A AS is the field-strength two-form. The unbroken 
direction is asymptotically given, in the sector with nonzero Q, by db; thus 
F%¢* is the field strength of the unbroken group. Equation (19.51) tells us 
that the total magnetic flux is proportional to @. Thus the configurations 
we are discussing are indeed magnetic monopoles, the elementary monopole 
having magnetic charge g = 1, 

For the elementary monopole configuration (Q = —1), the asymptotic 
behavior of ¢* becomes 
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¢? > -—v — (19.52) 
Finiteness of energy requires the vanishing of (D,@) as well. With the behavior 
of ¢° given above, this leads to the requirement 
a a 
A$ —> E€naik ape ont eg (19.53) 


A suitable ansatz for the fields for all r is then given by 
H 
eee =| H9) 


Ag = ae A-K®) (19.54) 


where € = evr. 

As we move in from spatial infinity, if 6* changes smoothly, with ¢? 0, 
we can use 7 itself as a homotopy parameter. If @ does not vanish anywhere, 
this will lead to a contradiction, mapping Q # 0 configuration to Q = 0, the 
latter being what we obtain as the sphere shrinks to zero size. This means 
that any configuration of nonzero Q must have a zero for the Higgs field ¢° 
somewhere. Further, K must go to 1 as r — 0 to avoid any singularity at 
r = 0 due to the 1/r? factor. Thus we have 


K-00, © as £€— 00 
K —1, as £0 (19.55) 


The static energy for this ansatz is given by 


2 
pies "8 easy + (é sid -#) +5(K?-1)? 
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e dé 
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+K°H* + qe (Ht 6") (19.56) 
The static equations of motion are 
d?K 
ae = KH? + K(K? = 1) 
DN 

2d —~9K2 2 42 

‘e = 2k H+3 H(H* — €*) (19.57) 


Exact analytic solutions of these equations are difficult, but one can find, 
numerically, solutions to these equations, with the conditions (19.55). The 
energy of the solution is generally of the form 
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ae ik f(d/e) (19.58) 


with f(0) = 1. The asymptotic behavior is of the form 
H —€ ~ exp(—Maur), K ~ exp(—Myr) (19.59) 


where My = ev and My = V2) v are the vector and scalar field masses due 
to symmetry breaking. 

Although we cannot solve equations (19.57) analytically, there is a partic- 
ular case, called the Bogomol’nyi-Prasad-Sommerfield (BPS) limit, in which 
this is possible. Notice that the energy functional (19.47) obeys an inequality 


E> / Br | ;BoBe + 5 (Did)*(Did)* 


2 
> | bx Bede)" 
: = (19.60) 


which follows from (B — Dd)? > 0. In equation (19.60) we have used the 
Bianchi identity (D;B;)* = 0 and equation (19.51). In the BPS limit, we 
consider \ — 0, still retaining the boundary condition ¢°¢° — v? as r > oo. 
The contribution of the potential term V to the energy is then zero and it is 


possible to saturate the inequality (19.60) by taking 
B? = (Di¢)* (19.61) 
The solution to this equation is 


g 
sinh € 


For this case, € = 4rv/e, in agreement with f(0) = 1 in (19.58). 

We see from equation (19.51) that the magnetic and electric charges obey 
the quantization condition eg = Q. The Dirac quantization condition was 
eg = n/2. The fact that eg is twice what is expected for the Dirac case is due 
to the fact that the starting group was SO(3) rather than SU (2). The theory 
can admit charges e/2 corresponding to the spinor representation of SO(3), 
i.e., fundamental representation of SU(2). The Dirac argument in this case 
shows that we again have eg = integer. Another way is to notice that the 
exactness of the homotopy sequence 


0 — Iz[SO(3)/SO(2)] > Ih[SO(2)] > 1,[S0(3)] + 0 


H(f) = €coth€ — 1, K(g) = (19.62) 


(19.63) 
o— Z = Z —, Ze —> QO 


shows that elements of IT2[SO(3)/SO(2)] should map onto even elements of 
IT, [SO(2)]. 
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19.3 The 1/N-expansion 


Another approach to the nonperturbative analysis of gauge theories is the 
1/N-expansion. The number of fields has been used as parameter to control 
the regrouping and resummation of the perturbation series in many theories. 
The symmetry restoring phase transition in a scalar field theory in Chapter 
18 is one example. There have also been successful nonperturbative 1/N- 
analyses in many lower-dimensional theories. For an SU(N) gauge theory in 
four dimensions with no matter fields, the property of asymptotic freedom 
shows that there is dimensional transmutation. The coupling constant can 
be eliminated in favor of a single dimensionful parameter A which defines 
the scale of the theory. Asymptotic freedom implies that modes of momenta 
much greater than A can be treated perturbatively; for modes of momenta 
comparable to or less than A perturbation theory is inadequate. There is 
no obvious expansion parameter that could be used for all modes. ’t Hooft, 
who introduced the 1/N-expansion in the context of gauge theories, observed 
that N (of SU(N) or the number of colors) may be used as an expansion 
parameter with the large N limit as the zeroth order starting point. To see 
how this is possible, we need a rule to work out the N-dependence of any 
Feynman diagram. For this purpose, it is best to consider the gauge potential 
as an (N x N)-matrix, (A,,)ij = —i(t*)ij AZ, 1,9 = 1,2, ..., N; each component 
of the field is represented by the color indices 7,7, each color index taking 
values 1,2,...,N. We are interested in large N, so we can use the gauge 
group U(N) rather than SU(N); the extra degree of freedom will have only 
a small effect as N becomes large. This will simplify our analysis. Thus t* 
are the generators of U(N) and obey the completeness relation (t*);;(t*) x = 
565K OiL- Since the ghosts are also in the adjoint representation, one can write 
Ci = c7(t")i3, Giz = C7 (t%)4;, etc. The gluon propagator is of the form 


((Ay)ag(@)(Av)ar(y)) ~ 6? nu G(a, y) ~ Fund eb (19.64) 


This shows that there is a flow of color along the propagator lines in every 
Feynman diagram and for the purpose of obtaining the leading power of 1/N 
for a diagram, we may represent each external gluon line and each internal 
propagator line by a double line as shown here. 


Fig 19.1. The double line representation of the propagator 


One can then trace the flow of color via the double lines throughout the 
diagram. Color loops represent N as coming from trace over indices like i, j. 
For example, the following one-loop diagram for gluon self-energy is seen to 
be of order e?N, where e is the coupling constant. The coupling constant 
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Or - +O 


Fig 19.2. A diagram which is of the leading order in 1/N 


will be taken to be of order 1//N, so that this diagram is of order zero in 
the large N-limit,which is the leading order in 1/N. Similarly, one can check 


Fig 19.3. Another diagram which is of the leading order in 1/N 


that the diagram given above is of the same order in 1/N because there is 
an additional color loop giving a factor of N from the trace and a factor of 
e” due to the additional vertices. By contrast, the diagram 19.4 given below, 
which has one propagator overpassing another, is of order 1/N?. 


Fig 19.4. A diagram which is of order 1/N? 


Notice that the first two diagrams, which are of order N°, can be drawn 
on a plane with no overpassing or underpassing of propagators; they are 
planar diagrams. The last diagram, namely figure 19.4, which is of order 
N~?, requires a surface with a handle for us to be able to draw it with no 
overpassing or underpassing of propagators. For diagrams with two external 
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lines, we see that the leading N-behavior is of the form N~?", where h is the 
minimal number of handles of the surface on which the diagram can be drawn. 
One can easily check by the double-line rule that this behavior persists more 
generally. We can classify all Feynman diagrams by the minimal number of 
handles of the surface on which they can be drawn with no overpassing or 
underpassing. If there are no external lines the N dependence is given by 
N2-2. For correlators of operators which are color singlets, the color indices 
i,j, etc., do not appear in the external lines. The same formula N?~2" holds 
in those cases as well. For diagrams with external lines, the general result is of 
the form ay N?~2" where ay is an N-dependent prefactor which is common 
for all diagrams with the same number of external lines. Thus generally a 
correlator is of the form 


Fan ag ae (19.65) 
h 


The 1/N expansion is an expansion in terms of the surfaces on which the 
diagrams can be drawn. In string theory, the perturbation expansion is in 
terms of the number of handles of the Riemann surface which corresponds to 
the world sheet swept out by the string as it propagates in time. The striking 
similarity between that expansion and 1/N expansion suggests that the large 
N limit of the Yang-Mills theory may be a string theory. This is also consistent 
with the dual superconductor picture, where there is a thin chromoelectric 
flux tube connecting the heavy quarks, essentially a string. However, to date, 
no one has succeeded in evaluating the planar limit of Yang-Mills theory 
which would be the lowest-order result for a string interpretation. Any result 
along such lines will shed light on confinement and other nonperturbative 
questions. In simpler models, and also for theories in lower dimensions, the 
1/N expansion has been used to obtain nonperturbative results. 

Another interesting property of the 1/N expansion is that the vacuum 
expectation value of gauge-invariant operators factorizes to the leading or- 
der. Let a1, a2 be two gauge-invariant operators. The leading terms of (a1) 
and (a2) can be represented by some set of closed planar double line loops. 
Consider now a term where we have a propagator connecting a; and ag, 
which is a term that contributes to the connected part of (a1a@2). Insertion 
of this propagator will bring in a factor of e”. In addition we have to connect 
the two color lines of the propagator to color lines in a1, ag. This means 
that one color line loop will also be lost. Thus the diagram where a1, a2 are 
connected will be suppressed by e?/N ~ N~? compared to (a1) (a2). Addi- 
tion of further propagators connecting a1, a2 will not change this behavior. 
This leads to the basic result that correlators of color singlet operators will 
factorize into a product of the expectation values each color singlet operator. 

In a pure Yang-Mills theory, the spectrum of the theory consists of color 
singlet states which are called glueballs. Color singlet operators can generate 
glueballs from the vacuum, and so, glueballs can be analyzed by studying 
correlators of color singlet operators. Having no color quantum number, the 
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basic quantum numbers of glueballs are spin (J), parity (P), and charge 
conjugation (C); there may also be additional quantum numbers character- 
izing higher-mass states of the same J?© value. Let a(z) be a color singlet 
operator and denote the leading N-dependence of (a(x)) as N%, using the 
same symbol for the power of N as the operator itself. The connected dia- 
gram for (a(x)a(y)) will be of order N2°~?; this is the main contribution to 
the propagator of the glueball. We normalize and define a glueball operator 
Oa(x) = N'~“a(a). Consider now the decay amplitude for a glueball O,(z) 
to decay into two glueballs corresponding to two other operators, Og(y) and 
O,(z). The connected diagram for (O2(x)Og(y)O,(z)) must have at least 
one gluon propagator connecting O,(r) to Og(y) and at least another prop- 
agator connecting O,(r) to O,(z). This gives a factor N~* compared to 
(a(x) G(y)y(z)), so that (Oa(r)Og(y)O,(z)) goes like N—!. Thus the decay 
amplitude for a glueball is of order 1/N at large N. This means that a glue- 
ball cannot decay into other glueballs at large N, we expect stable glueballs 
at large N. By a similar argument, the decay of a glueball into three other 
glueballs will be of order N~?; since the 1 — 3 decay amplitude and the 2 — 2 
scattering amplitude arise from the same vertex, of some hermitian effective 
Lagrangian, we can conclude that the glueball-glueball scattering amplitude 
is also of order N~?. (The existence of a hermitian effective Lagrangian leads 
to the symmetry of the scattering amplitude under exchanges of incoming 
particles with outgoing particles, along with suitable changes of momenta; it 
is known as crossing symmetry.) 

This result can be very useful on two counts. Imagine that we are able 
to solve the Schrédinger equation for the Yang-Mills theory and arrive at its 
spectrum. Then for each value of J?©, we will just find one eigenstate, since 
the higher states can decay in general and are not eigenstates. But if we do 
the analysis at large N, the higher states are also stable and we can identify 
them as eigenstates of the Hamiltonian at large N and then put in the decays 
and other corrections as a second step. Thus the 1/N expansion can help to 
sort out the states of interest. Secondly, the emerging picture is that at large 
N, we expect a sequence of states for each value of J?° with some quantum 
number characterizing the higher-mass states. The higher-mass states are 
narrow resonances with width ~ N~?. This is again very similar to what 
happens in a string theory, further giving support to the idea that large N 
Yang-Mills theory is some kind of string theory. 
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In accordance with the idea of confinement, we expect the states in QCD to 
be color singlet combinations like mesons and baryons and glueballs. What 
are the implications of large N analysis for the mesons and baryons? This is 
the question we will consider now. 
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19.4.1 Chiral symmetry breaking and mesons 


We consider Ny flavors of massless quarks in U(N) Yang-Mills theory. First 
of all, since quarks transform as the N-dimensional fundamental representa- 
tion of U(N), quark loops have only one color line if we use the double line 
representation for gauge bosons. As a result, for any correlator, each quark 
loop gives a factor of N~! in any diagram compated to similar diagrams 
which have no quark loops. Further, the axial U4(i) anomaly of QCD is not 
relevant for the following argument about chiral symmetry breaking; its ef- 
fects will be seen to be at least of order 1/N. The chiral symmetry group is 
then Uy(Ny) x Ur( Ny), neglecting quark masses and other weak sources of 
explicit chiral symmetry breaking. 

Coleman and Witten have given an argument to show that large N leads 
to spontaneous breaking of this chiral symmetry in QCD to the diagonal 
subgroup Uy(Ny). For this one assumes that chiral symmetry breaking can 
be described by a composite field of the form 


Bi ~ Oh, Qia 2 (19.66) 


Under the chiral symmetry, this transforms as  — gi Pq}, regarding ae as 
the matrix elements of the (Vy x Ny)-matrix @. The color indices a of the 
quark fields are summed over to form a color singlet. 

The effective potential V for ® must have the full chiral symmetry. The 
chiral transformations can be used to bring @ to a diagonal form with real 
nonnegative eigenvalues. (The argument is similar to what we used for our 
discussion of the CKM matrix in Chapter 12.) Thus V may be taken as a 
function of the eigenvalues of & or equivalently eigenvalues of 1S or G1. In 
V we will have some terms of the form Tr(*)™. We could also have terms 
like Tr(6')*Ty(S$1)!; terms with products of three or more separate traces 
of 6% are also possible. For the product of two separate traces, since @ is 
made of quark operators, one has to have two quark loops in the diagrams 
which generate such terms from the underlying theory, namely, quantum 
chromodynamics. Connected diagrams contribute to the effective potential, 
and so, the two quark loops have to be connected by at least one gluon 
propagator. Such terms are, therefore, down by a power of N compared to 
terms with a single trace. Similarly, products of higher numbers of traces are 
also smaller by further factors of N—!. Thus, in the large N limit, the form 
of the effective potential must be 


V=N)_ cTr(O6')" = Nf(GS") (19.67) 


where f is some N-independent function. The overall factor of N is seen to 
be correct by considering a term like (661); the summation over the colors 
of the quark operators leads to N. In terms of eigenvalues ; of 661, V = 
N >0; f(s). Assuming there is a single global minimum for the function f, 
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we see that all eigenvalues must be the same, say, Ao, at the minimum. Thus 
the expectation value of @ must be of the form diag(Xo, Ao, -°: ,Ao). The 
unbroken subgroup is thus Uy(Ny). If there is chiral symmetry breaking, 
large N considerations would show that breaking to the diagonal subgroup 
is preferred. 

Notice that if all the eigenvalues of © are different at the minimum, it 
would correspond to an unbroken symmetry of U(1)*’. Thus, in using ®, we 
are excluding breakdown to smaller subgroups. This might seem too strin- 
gent a starting point. However, there is a theorem due to Vafa and Witten 
that in any gauge theory with only vector-like coupling to quarks, parity 
cannot be spontaneously broken. Thus, if we neglect electroweak effects, the 
minimal group to which the chiral symmetry can be broken is Uy(Ny). This 
considerably strengthens the large N argument. 

A priori, it is possible that there be no chiral symmetry breaking in QCD; 
in other words, Ao could be zero. This possibility can be ruled out as follows. 
Consider any flavor current, say, a left-handed current of the form J, = 
Q1€7.Q , where € is some combination of the generators of U_(N+); we will 
take a combination for which Tr(é?) 4 0. This current has an anomaly which 
may be expressed as 


N 
1272 Tr(€*)eueypq” 


Dwa(p,a,7) = / d*ad*aq e'? +4972 (J (21) J,(x2)Jo(0)) (19.68) 


rT ywalp, q; r) =1 


where p® + q* + r* = 0. (Result (19.68) may be obtained by starting with 
nonzero flavor gauge fields, obtaining the anomaly equations and then func- 
tionally differentiating twice with respect to the gauge fields and setting them 
to zero.) This result shows that I,» cannot be analytic at p= q=r=0. 
For if it were analytic, it could be expanded in powers of p,q; but the only 
combination which has the correct power of momenta and the permutation 
symmetry of the currents is €,»ag(p? + q° + 7°), which is zero. This shows 
that the result (19.68) is not compatible with analyticity of Io at zero 
momenta. Therfore there must be zero-mass poles like p~? or q~? or r~? 
in Ive. This would correspond to a zero-mass particle whose propagator 
can lead to such a pole. J,, has to create this zero-mass state acting on the 
vacuum. Since the current is a vector, it may create a vector or a scalar 
particle. However, for massless particles, the possibility of creating vector 
particles is ruled out. A simple way to see this is as follows. If a vector par- 
ticle of momentum & and polarization is created by the current, we can 
write (k, \|J,(k)|0) = fA(k), where J,,(k) is the Fourier transform of J, (2). 
The quantity ay, = >>) fA(k) f2>(k) must be a function of k which obeys 
kop» = 0, k” ayy = 0. There is no such tensor we can construct since k, is 
a null vector. (For massive vectors of mass m, this could be nyy — kyky/m?.) 
We are led to the conclusion that J, has to create scalars. The only way a 
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current can create a massless scalar particle is due to spontaneous symme- 
try breaking, in which case J, ~ O,p, for a Goldstone boson y. Thus the 
possibility of the chiral symmetry being unbroken is ruled. 

This part of the argument, based on anomalies, is a specialization of the 
general idea we mentioned in Chapter 17, due to ’t Hooft, of finding possible 
symmetry-breaking patterns by matching anomalies in ae confined phase 
with the calculation in terms of quarks. 

Once we accept the idea of chiral symmetry breaking at large N, the 
effective Lagrangian of Chapter 12 can be used to represent interactions. 
The meson-meson interaction was found to be of order f-?. Since QQ has an 
expectation value which goes like N (due to the one quark loop), the meson 
operators are of the form N~1/?QQ. For three such operators, the leading 
diagram has one quark loop again, and so, the three-meson vertex will be of 
order N~!/2, Meson-meson scattering amplitude is of order N~}. This shows 
that we must interpret f? to be of order N. This may also be seen directly 
from the equation 


ew tke 


Vv 2WV 


by counting powers of N in a diagrammatic expansion. Equation (19.69) is 
essentially (12.103). 


(0|J,|k) = —iftky 


(19.69) 


19.4.2 Baryons 


We now turn to the question: how do we see baryons in the large N limit? 
Baryons are bound states of N quarks for a U(N) gauge theory. The following 
argument, due to Witten, can be used to estimate the N-dependence of the 
mass of a baryon. The quarks interact via potentials generated by multigluon 
exchanges. A single gluon exchange between a pair of quarks in the baryon 
will give a factor of e? ~ 1/N. Every additional gluon propagator between 
the same quarks will give e?N ~ 1, the factor of N arising from trace over 
color indices. The interaction energy between a pair of quarks thus goes like 
1/N for large N. In the baryon there are $N(N — 1) pairs of quarks, so the 
total interaction energy goes like e2N(N—1) ~ N for large N. Since N is the 
expansion parameter, we see that the baryons have energy which goes like 
the inverse of the expansion parameter. This is typical of solitons in a field 
theory where the mass of the soliton goes like 1/X, where X is the coupling 
constant. They cannot be obtained in perturbation theory; one has to look 
for nonperturbative field configurations which have stability and behave like 
particles. Thus, in the large N limit, baryons should be seen as solitons. 

We have already seen in Chapter 12 that, due to chiral symmetry breaking 
in the large N limit, the effective theory is the theory of mesons described by 


tN 
2407? 


S= fr z fat Tr(d, Uia+U) — (Tr(dU Uae 
x (19.70) 
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This is the action neglecting quark masses and electroweak gauge fields and is 
obtained by putting together (12.89) and (17.32). We will now show that this 
theory has soliton solutions; they are called skyrmions, after Skyrme, who 
found these solitons and showed that they have many of the properties of 
baryons, many years ago, before the gauge theory of strong interactions was 
invented. For the purpose of analyzing solitons, we will consider the modified 
model 


1 


i 
S= 5 fe f as OPO pee OOO PIT 


<= soe | (aU Ua (19.71) 


The extra term is known as the Skyrme term; ¢ is another coupling parameter 
in the theory. Such higher-derivative terms can in general exist since this is 
an effective action. A particular form which is convenient has been adopted 
here; its purpose is only to give a clear presentation of skyrmions. Addition of 
other possible terms will not change the essential features of the discussion. 
The energy functional for static configurations is 


E= / a E f2Tr(0,U0,U") — pamlavu-avuP)| (19.72) 


32€? 

Recall that the Wess-Zumino term, being first order in the time derivatives, 
does not contribute to the Hamiltonian. (Alternatively, being defined by a 
differential form, it does not involve the metric and so does not contribute to 
the energy.) For the vacuum configuration, we have U = 1. For configurations 
of finite energy, we need U(x) to go to the vacuum value, namely, 1, as 
la| — co. Since U goes to the identity at spatial infinity, U(x) is effectively 
a map from S° to U(Ny). The homotopy classes of such maps are given by 
IT3[U(N+)] = H3[SU(Ny)] = Z and can be labeled by the winding number 


QU] = - ae eg f TU~taU)° 


1 
~ 2472 


d°z e4*Tr(U-10,;UU-10;UU'0,U) (19.73) 


This may be taken as the charge associated with a current 


jt = = sagt” PTH(U-18,UU10,UU 1,0) (19.74) 


This current is identically conserved without the need of equations of motion. 
Field configurations with nonzero @ are the solitons or skyrmions. Time- 
evolution of a field configuration has to conserve @ and so, configurations 
with nonzero Q maintain their identity; the only way they can disappear is 
by pair annihilation of a soliton (of winding number Q) and an antisoliton 
(of winding number —Q). 
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The simplest set of maps with nonzero Q correspond to maps from S* to 
an SU(2) subgroup of SU(Ny). For Ny = 2, a spherically symmetric map is 
of the form 


U(x) = Ug(a) = exp (id(r)r - Z) 
= cos ¢(r) + i7 - €sin 9(r) J (19.75) 


where 7; are Pauli matrices. For this configuartion to be nonsingular, we 
need the boundary behavior sin (0) = 0 = sin ¢(co); thus ¢(0) are @(0o) are 
multiples of 7. The configuration (19.75) has 


Q==[60)-d(0o)} (19.76) 


By taking ¢(0) — ¢(00) = a, we get a skyrmion of winding number 1. A 
configuration with Q = 1 is given by 


2h 


—_ 19. 
R2 + r2 (aay) 


sind = 
which corresponds to the stereographic projection of a sphere onto R?. 

The actual profile of the soliton is not determined by the topological 
considerations. One could take (19.75) as an ansatz and solve static equations 
of motion (or slightly modified versions of them) to get a solution. Since this 
is an effective theory, it suffices for many purposes to treat this variationally. 
We can take a profile for ¢(r) with the required boundary values and a scale 
parameter R and calculate the energy and minimize to determine R. The 
first term in (19.72) has dimension —1, while the second term has dimension 
1; thus the general form of the energy is 


b 
E=af?R + 2R (19.78) 
where a, 6 are constants. This gives the best value of R as R = ,/b/ae?f2. 
Notice that without the Skyrme term, it is energetically favorable for the 
soliton to shrink to zero size. The Skyrme term is introduced to stabilize the 
soliton. For this purpose, any term of dimension > 1 will do; the Skyrme 
term is just a simple choice. 
In order to identify the Skyrmions as the baryons of the large N limit, 
we need to show that they carry baryon number, are fermions, and have the 
correct flavor properties. We will consider these questions now. 


19.4.3 Baryon number of the skyrmion 


A technique for identifying the baryon number of the soliton is to couple the 
theory to quarks so that the baryon number operator can be defined. We 
thus add to the effective action a term 
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Sq = Q(iy: 9)Q —m (QrLUQr + QnU'Qz) (19.79) 


We have included a mass term for the quarks. Since quark has baryon number 


1/N for QCD with SU(N) symmetry, the baryon number operator is defined 
by 


B= x i, Pr QyQ (19.80) 


The strategy for identifying the baryon number of the soliton is then to 
calculate the expectation value of B for the soliton state, which corresponds 
to having a background value for U given by the soliton configuration. Thus 
we calculate 


(B)s = x; f az (S107°QI5) 
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The second line expresses the baryon number in terms of the short-distance 
behavior of the fermion propagator in the presence of the background U. 
Since only the short-distance behavior is needed, we can obtain it by an 
expansion in powers of 1/m. (It was for this reason that a quark mass term 
was introduced.) 

The propagator obeys the equation 


(iy-9—mF)S(z, y) = 16 (x — y) (19.82) 


where F = 3{U(1— +5) + U'(1 + 45)]. The solution to this may be written 
as S(x,y) = —(iy-0+ mF") G(z,y), where G is defined by 


(+ m? — imy- (OF")) G(a,y) = 15 (a — y) _ (19.83) 
We have used the properties FF' = 1 and y,F! = Fy. We will expand G 
in powers of X = imy- OF" by iterating the integral equation 
Gl2,y) = Gole,y) -i f Gola, 2)X@)G9) 
z 
d‘p a : 
=-i | — = _ _ 19.84 
Go(z, y) U (27)4 p — m2 + te exp| ip(z y)] ( ) 
In the expression for the expectation value of the baryon current, the relevant 
term for us is 


(J#(a)) = 2s [y(iy- 8+ mF*G(2,y)] 
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The factor of 1/N is canceled by the trace over the N colors of quarks. 
There are other terms which can contribute to the current. The terms which 
are potentially divergent give zero by Dirac and SU(N) trace identities. We 
have neglected terms involving derivatives of X in passing from the first to the 
second line of this equation; such terms could give additional contributions. 
The term we have displayed leads to a baryon charge given by the winding 
number Q. The density for Q is a closed but not exact three-form belonging 
to an element of H?[SU(Nf)]. Since this term is unique, all the other terms 
in (J“) have to be of the form 0, A”, where K” is antisymmetric, so that 
we have conservation of the baryon current. In the expression for the charge, 
such terms integrate to zero. Thus, even though one can have corrections 
to the expression we derived for the current, such terms are not needed for 
the charge. (This is why we calculated only the one required term in (J*). 
The key to our derivation is to show that a baryon number equal to Q is 
generated, not that there are no other terms in the current.) We have thus 
shown that skyrmions have baryon number which is given by the winding 
number. 


19.4.4 Spin and flavor for skyrmions 


For discussing the flavor and spin properties of the soliton, we will just con- 
sider Nz = 3 and N = 3; there is no substantial difference for higher numbers 
of flavors. The argument presented below is essentially due to Witten. 

The soliton configuration (19.75), for N+ = 3, is 


Uae Gar - £) 7 (19.86) 


This configuration is not invariant under rotations of x, but is invariant 
under combined rotations and SU(2) transformations generated by the 7’s. 
Therefore, we see that spatial rotation may be taken to be given by Us — 
GUsGt, where G is an SU(2) transformation of the form 


G- ey - 6) - (19.87) 


To begin with, the theory has symmetry SUy(3) x SU (2), where SU;(2) 
corresponds to rotations; but the configuration (19.86) does not have this full 
symmetry. In the quantum theory, quantum fluctuations of the configuration 
(19.86) occur and among these there are some which correspond to moving 
by (spatially constant) SUy(3) x SU ;(2) transformations. Since this is a 
symmetry of the theory, the static energy does not change. One can then take 
the parameters of such transformations to be time-dependent and quantize 
fluctuations around (19.86) by just keeping these modes. This is the first step 
in a semiclassical quantization of the soliton and will suffice to show its spin 
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and flavor properties. The ansatz for the soliton, with the constant flavor 
rotations, is 


U(z,t) = A(t) Us Al (t) (19.88) 
where A(t) is an SU(3) matrix. The symmetries are now realized in the 
following way. 


Al(t) = V A(t), A'(t) = A(t) G (19.89) 


where V € SUy(3) will give the flavor symmetries and G gives spin or ro- 
tations. We may define a set of differential operators which are infinitesimal 
generators for these transformations as 


be A stg A; Re AA ta (19.90) 
Writing A = exp(itay*) for some parameters y’, a,i = 1,2,---,8, we can 
identify 
L, =—i(E7? ‘aa fin Shee Sa (19.91) 
where 
dAAl = —it, Edy’, AldA = —itgE%dy' (19.92) 


To obtain the quantum theory, we now substitute (19.89) into the action 
(19.71) to obtain 
S= / dt -Semeatanayy =. (Trt. AtA)P = ie Tilted dA) 
(19.93) 
where t’s are SU(3) matrices and 1 = 1,2,3 ,k = 4,5,6,7. a and @ are some 
constants arising from the spatial integrations whose precise values are not 
important here. Notice that there is no quadratic term involving Tr(tg A‘; A). 
In quantizing, the time-derivatives of Tr(tgA'QA) for a = 1,2,---,7 will be 
related to the differential operators Lg or Ra. The wave functions will be 
functions of the SU(3) matrix A and the action of various operators will 
be defined in terms of L, and R,. But since there is no quadratic term for 
Tr(tsA'd,A), we do not get a canonical momentum for this direction. Under 
A-— Aexp(itg@), the action changes as 


NQ; 
S(Aew#?) =S + 19.94 
(ae) 9 2/3" ( ) 
This means that the wave functions must have the property 
W (Ae?) = WA Gr) 19.95 
(Ae'#*®) = (A) exp (15 (19.95) 
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This puts a constraint on the allowed wave functions. (The fact that the wave 
functions must obey such an equation if the action changes as in (19.94) is 
easily seen in a path-integral representation for the wave function.) 

The wave functions can be generally written as 


W(A) = Cr D®(A)z,1,¥3',1,,¥" (19.96) 


The D ’s are the matrices representing the group élefnent A in some rep- 
resentation R, which is a priori arbitrary. (D’s are the Wigner D-matrices 
for SU(3).) Cr is a normalization constant. The matrix labels are speci- 
fied as follows. L? + L2 + L? = I(I + 1) is the quadratic Casimir for the 
SU(2) subgroup of the left action of SU(3) on A. Iz is the eigenvalue of 
L3 and Y is the eigenvalue of (2//3)Lg. (This is the hypercharge.) I’, Ii, 
and Y’ refer to the corresponding quantities for the right action R, on A; 
thus R? + R23 + R2 = I'(I' +1), Rs = I, and (2//3)Rg = Y’ when acting 
on DF(A) I,Is,¥3I',1,,¥’- These are the quantum numbers needed to specify 
a state within an SU(3) representation. Notice that A, R2,R3 define the 
rotation group for the soliton, so that I’ = J, the spin of the soliton in the 
quantum theory. 

The constraint (19.95) tells that we must choose Y’ = 1 for N = 3 and 
Q = 1. The lowest-dimensional representations of SU(3) containing any state 
with Y’ = 1 are the (8-dimensional) octet, which is the adjoint representation, 
and the (10-dimensional) decuplet. For the octet, the state with Y’ = 1 has 
a= 4. The left indices show that these fall into a flavor multiplet which 
is an octet; since Y’ is fixed, there are only two values for the right indices, 

— +3. Thus we get an octet of spin-3 soliton states. For the decuplet 
f= f= 3, leading to a flavor decuplet of solitons with spin-3. Thus, this 
theory leads to a set of solitons carrying baryon number 1, whose low-lying 
states are a spin-5 octet and a spin-3 decuplet. This is in complete agreement 
with the quark model, which was discussed in Chapter 12, and with what is 
observed. The soliton, which is a coherent field configuration of bosonic fields, 
has become a fermion due to the effect of the Wess-Zumino term. 

For two flavors, there is no Wess-Zumino term, since H°[SU(2)] = 0. 
Since soliton configurations are maps from S* to SU(2), we find that I,[C] = 
IT,|SU(2)] = Zz, where C denotes one component of the configuration space. 
Thus C can support double-valued wave functions leading to the possibility of 
fermions. This was already noted by Skyrme many years ago. The extension 
to SU(3) gives a concrete realization. One may even regard the embedding 
in SU(3) as similar to our argument for the global anomaly of SU(2) and 
one can obtain the fermion nature of the solitons by a modification of that 
argument. 

There are a large number of papers which show further properties of 
baryons from the soliton point of view and explore the Skyrme model as a 
complementary picture of baryons in the standard model. 

In conclusion, we see that the large N theory can reproduce all the ex- 
pected features of the particle spectrum, lending further support for the 1/N 
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expansion as a sensible way to analyze the nonperturbative aspects of the 
gauge theory. 


19.5 Lattice gauge theory 


19.5.1 The reason for a lattice formulation 


A very important approach to the study of nonperturbative features of a 
nonabelian gauge theory is lattice gauge theory. The basic idea here is the 
following. Quantum field theory can be described by the Euclidean functional 
integral. This integral has to be defined by first considering the theory trun- 
cated to a finite number of modes, calculating the finite dimensional ordinary 
integral and then taking the limit as the number of modes goes to infinity. In 
the lattice approach, the truncation to a finite number of modes is made by 
considering the theory on a finite lattice which is usually taken, for simplicity, 
as a simple cubic lattice of appropriate dimensions. Thus in four dimensions, 
one considers a four-dimensional hypercubic lattice with M lattice points 
along each direction with a lattice spacing a. The field variables are located 
at each lattice site or on the link connecting two nearby lattice sites, so that 
the total number of field variables is finite, proportional to M*. Since the 
shortest possible distance between two points is a, there is no ultraviolet di- 
vergence, a serving as the cut off parameter. The continuum theory, which 
is what we are interested in, can be obtained as M — oo, a — 0. Rescaling 
of fields and coupling constants and renormalization of parameters would be 
needed in extracting this limit. The four-dimensional Euclidean invariance 
can also be regained in this limit. 

This approach is important for many reasons. On the theoretical side, the 
theory on the lattice gives a mathematically well-defined functional integral 
and correlators and so it is a good starting point for investigating questions 
like the existence of a field theory in a fully nonperturbative sense. This is 
the approach of constructive field theory and indeed this has been used to 
argue that the ¢*-theory in four dimensions, despite its great usefulness in 
perturbative calculations, may be trivial. (Basically this is the statement that 
if the bare theory is formulated on the lattice with bare parameters taken to 
be finite as a — 0, then the continuum theory will be a free theory.) 

On a more practical side, the truncation of the functional integral to a 
finite dimensional integral leads to the possibility of numerically evaluating 
the integral for a set of values of (M,a) and then extrapolating the results 
to the continuum limit M — oo, a — 0, to obtain numerical estimates of 
physical quantities. The approach is intrinsically nonperturbative and with 
the increasing power of electronic computers, this becomes more and more 
practical. Since it is possible to formulate gauge theories on a lattice man- 
ifestly preserving gauge-invariance, the lattice approach can be particularly 
useful for gauge theories. In practice, we still have to do a lot of analytical 
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work formulating the lattice versions of physical quantities and a procedure 
for estimating them as well as developing a number of specialized numerical 
algorithms for implementing these calculations via computers. A large body 
of work has been built up over the years. Many qualitatively general features 
of gauge theories such as confinement and the deconfinement transition at 
finite temperatures can now be analyzed in a reasonably reliable way. There 
have also been some estimates of glueball masses. — 

In this section we will review the most elementary features of a lattice 
gauge theory, considering a theory with no spontaneous symmetry breaking. 


19.5.2 Plaquettes and the Wilson action 


The basic gauge-field variables are defined on links connecting nearby lattice 
sites. They are elements of the Lie group SU(N) for an SU(N) gauge theory 
and are denoted by U(x + yw, x), where x denotes a lattice point and z+ p 
the nearby lattice point obtained from x by translation by an elementary 
lattice vector (of magnitude a) in the direction y. We may take U’s to be 
in the fundamental representation of SU(N) and consider them as (N x N)- 
matrices. On the cubic lattice we then consider the faces of the unit cells, 
one example of which would be the square with vertices x7, © + pp, r+ pu+y, 
x +v. The faces of the unit cells are called plaquettes; they provide a tiling 
of any surface we can form in the lattice. The action is defined in terms of 
the trace of the product of U’s around the edges of a plaquette. We define a 
plaquette variable 


Wp(2) =W(z,2+p,2+p+v,2+0) 
=T|U@.2+¥) U(rg+vu,r+pyt+v) 
Uetutyetu) U(e+m2)] (1997) 


The simplest version of the lattice action for a gauge theory, first given by 
Wilson, is then 


S= 22 f - y ReW(2)| 
= 6 Soe 
P 
S' = -£ > ReWp(z) (19.98) 
P 
The sum is over all plaquettes P in the lattice and @ is related to the bare 
coupling constant e9 by B = 2N/e?. 


Gauge transformations are SU(N)-valued functions and are defined at 
the lattice sites. Thus, under a gauge transformation 
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U(x +p,2) > U9(x + p, 2) = g(x + pw) U(a + p, 2) gt (zx) (19.99) 


where g € SU(N). Wp(z) is invariant under such a transformation and so is 
the action (19.98). 

The relationship with the continuum formulation is as follows. U(x+ p, 2) 
is the lattice version of the parallel transport matrix; this is clear from the 


transformation law (19.99). For small lattice spacing, the gauge potential can 


be defined by 
r+ 


U(x+p,2) + Pexp Gi 
z 


W p(x) is then the Wilson loop operator where the loop corresponds to the 
boundary of the plaquette P and so 


be 
t7A2 ac" (19.100) 


Wp(x) © Tr [es rao] (19.101) 


a” is the area of the plaquette and equation (19.101) is valid for small areas, 
or as a — 0. We thus find 


~ ae a Tayv 
Se / d*a qe hw F (19.102) 
The summation over plaquettes, with a factor of a+, becomes the integral over 
the Euclidean spacetime volume. Equation (19.102) agrees with the contin- 
uum action, showing that (19.98) does indeed correspond to the Yang-Mills 
theory. This limit is often refered to as the naive continuum limit since the 
actual continuum limit of the theory defined by evaluating the functional 


integral and then taking M — oo, a — 0 may give a very different behavior. 
The functional integral is defined by 


Z= yf [] 4(V) exp(-s) (19.103) 


links 
where du(U) for each link is the Haar measure for the group SU(N). This 


is defined as follows. The Cartan-Killing metric on SU(N), considered as a 
differential manifold, is given by 


ds? = —2 Tr(U-'dU)? = Ef Etdy' dy? (19.104) 
where U-1dU = —it? E2(y)dy', yp’ being group parameters in terms of which 
we can write U; for example, we can take U = exp(—it*p"). (In (19.104) we 


have the symmetric product, not a wedge product.) The volume element 
corresponding to (19.104) is 


du(U) = (det E) | [ de’ (19.105) 


This is the Haar measure for the group. 
There are two properties of du(U) which will be useful. 
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1. The total volume of the group defined by 


vol[SU(N)] = / du(U). (19.106) 
is finite. 
2. Representation matrices are orthonormal with respect to the Haar mea- 
sure, i.e., 
(R)rptyp(R) yy — VOUSUCN)] sar’ 
[au) DU IO) = ER Oe oeadas (19.107) 


where Dw ) is the matrix representation of the group element U in the 
irreducible representation R. A, B take values 1,2,---,dimR; a, § take 
values 1,2,---,dimR’. (These are the Wigner D-functions for SU(N).) 

In the functional integral (19.103), in integrating over all U’s, we end up 
integrating over the directions in configuration space corresponding to gauge 
transformations as well. Since there are a finite number of points, this does 
not give a divergence and the volume of gauge transformations gets divided 
out in expectation values. 


Strong coupling expansion 


The total volume of integration in (19.103) is finite, so one may expand 
the exponential exp(—S) = exp(—{ >> p 1) exp(—S’) in (19.103) as a series in 
S’ and evaluate integrals term by term to get a series in 1/e2. This is referred 
to as the strong coupling expansion and, a priori, would seem reasonable for 
large eg and fixed a. €9 is the bare coupling constant for the lattice with 
lattice cut-off a and so this expansion is appropriate for distance scales of the 
order a. Asymptotic freedom tells us that the effective coupling increases with 
separation of charges, so that to get a finite coupling at a given separation 
as a — 0, we must tune €9 to smaller The strong coupling expansion is thus 
not appropriate for the continuum limit where a — 0. Nevertheless, some 
interesting features emerge from such an expansion. 

The lowest-order term in the expansion of the exponential will give a 
constant f []iinn5 4u(U) = {vol[SU(N)]}"“, where nz is the total number of 
links. In the next term, which is of the form { du(U) S’, we encounter integrals 
like f dU(x+ , 2) U(x+ , x), which is zero by (19.107); thus f du(U)S’ = 0. 
To get a nonzero value we must have the product of U(x + y,2x) and its 
conjugate U(x, + y) in the integrand. This can arise from a product of 
Wp’s for nearby plaquettes with one common edge (x + y, 2), for example, 
the plaquettes (v,r+u,r+p+v,c4+v) and (2,2+p,2+p+a,r2+a). 
Integration over U(r+p, x) using (19.107) then replaces U(z+p,x)U (zx, +p) 
by a constant, giving a surface made of two plaquettes glued together along 
the edge (x + y,x) and a remaining Wilson loop along the boundary of 
this surface. In [ du(U)S’? we will find such a possibility, one factor of S’ 
contributing U(x + w,x) and the other giving U(z,2 + uy). f du(U)S” is 
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still zero since there are remaining integrations over link variables with no 
matching conjugate element. Continuing in this way, it is easy to see that 
the only way we can get conjugate elements for each link variable is if the 
plaquettes under consideration form a tiling of a closed surface ©. We can 
have a factor of Wp for each plaquette on the closed surface which comes 
from a corresponding factor of S’ in the expansion of the exponential; the 
edges would match, giving pairing of the link variables. In this case, the 
integral will be proportional to {1/e2}"(~) ~ exp [n(2) x constant], where 
n(%’) is the number of plaquettes for the surface Y. The functional integral 
thus reduces to some kind of summation over closed surfaces with a weight 
determined by an action proportional to the number of plaquettes or the 
area of the closed surface. The surfaces inolved are not necessarily simple; 
for example, one can have surfaces with different branches sharing common 
links. Nevertheless, this structure is again reminiscent of string theory where 
the functional integral is given as a summation over surfaces. 

A curve C in the lattice can be considered as formed by a sequence of 
links 1,,12,---,l«% and the Wilson loop operator over C’ can be defined in 
terms of the product of U’s along these link segments. Thus 


Woe = Tr (Ui, Ui, ++ Uix) (19.108) 


In the strong coupling expansion, we can see that the expectation value of 
the Wilson loop will give an area law. Consider 


(Wc) = ii du(U) Wo e 5 (19.109) 


In matching the U’s to get nonzero contributions, we see that we can now 
match the U’s from the expansion of e~° with the U’s from the Wilson 
loop. The integration then reduces to summation over open surfaces with the 
boundary C. Since the minimal number of plaquettes for tiling this surface is 
proportional to the minimal area of a surface with boundary C, we see that 
the behavior of the Wilson loop is 


A(C 
(Wo) ~ exp (- x ) log e? (19.110) 
We get an area law, but this is in the strong coupling expansion and does not 
provide any argument for confinement as a — 0. The general expectation is 
that for dynamical reasons, this behavior will persist to the continuum limit. 


19.5.3 The fermion doubling problem 


Lattice versions of matter fields can also be introduced in a simple way. 
Matter fields are associated with lattice sites. Thus, for a scalar field, we 
introduce a field variable ¢(x) at every lattice point x. Derivatives can then 
be defined by 
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0.6 = —— (19.111) 


It is then possible to construct actions of the form 


2 we 
S= Ee E (Sete ot) + Catt) + 20%a] (19.112) 


In the small a limit, this goes to the continuum action 
eee 
a / ae 15 (36)? + Te + rét (19.113) 


In the case of fermions, a similar procedure can be carried out but leads 
to the so-called fermion doubling problem. Any lattice action for fermions 
with locality of interactions, hermiticity of Hamiltonian and translational 
invariance on the lattice will give a continuum theory with an even number 
of fermion fields with pairs of fields of opposite chirality. This means that 
we cannot set up a lattice theory if the desired continuum theory is chiral in 
nature, such as the standard model, unless we give up some of the conditions 
mentioned above. 

The doubling problem can be illustrated in a simple way by considering 
massless fermions in one spatial dimension; we will also use a Hamiltonian 
approach. First we shall consider the continuum theory for which we have 
the Dirac equation 


Pac oy A (19.114) 


We use the y-matrices, y° = 01, y! = io2 and take the chirality operator as 
—7°71 = o3. For left chirality, the Dirac equation becomes 


4 (19.115) 


so that, with the substitution  ~ exp(—ipot + ipr), we find pop = —p. Thus 
negative values of p lead to the positive energy solutions, pp = wp, = |p|; 
upon quantization, these will be associated with the annihilation operator 
for particles. Notice that these are left-moving modes on the line. Positive 
values of p lead to pp = —wp = —p, corresponding to the creation operator 
part. For right chirality, we get po = p, with positive values of p, or right 
movers, corresponding to particle annihilation. There is thus a correlation 
between chirality and left- and right-moving modes. 

Consider now one chirality, say, the right one, on the lattice. Keeping time 
as a continuous variable still, the expected equation is of the form 


im +i (Het — ve) =0 (19.116) 


a 
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The second term will correspond to an expression —i }>,, #'(x)[b(x + a) — 
#(x)] in the Hamiltonian operator. This is not hermitian by itself. The hermi- 
tian form which also gives terms which are first order in the spatial derivatives 
asa — 0 is >, ${-i (x) [W(x + a) — ¥(x)] + h.c.}. This choice leads to the 


equation 
Ov, .fv(et+a)—a(x—-a) 
We solve this by introducing a mode expansion of the form 
ey) ep Pla: (19.118) 
P 


Periodic boundary conditions are the easiest to use for the lattice, so that for 
a lattice of length Ma we have 7(xz + Ma) = (x). This gives the possible 
values of p as 27m /Ma, where n is an integer. Since x is an integer multiple 
of a, exp(ipm4nZ) = exp(ipnx + 2nix/a) = exp(ipnxr). Thus there are only 
M independent modes, which we may take as 


2 es 
a es ee ee 


(19.119) 


(We will take M to be odd for simplicity.) The points +(M — 1)/2 are the 
edges of the Brillouin zone, giving the highest possible values for |p| on the 
lattice. Using the mode expansion (19.118) in the Dirac equation (19.117) we 
get 

_ sin(pa) 


: (19.120) 


Po 


For modes near n = 0, p is small and as a — 0, we get po = p corresponding 
to a massless mode of right chirality. However, we can also consider modes 
near the edge of the Brillouin zone. For modes near (M — 1)/2, we can write 
p = (27/Ma)(M — 1)/2— , p > 0; we then find sinpa = sinpa as M — oo. 
This leads to another mode with pp = p as M — oo, a — O. In the mode 
expansion, this is of the form 


(a) ~ e'™*/* SN exp(—ipt — ipa) ag (19.121) 
p 


Thus e~*7*/%)(zx) is another field in the continuum which behaves as a mode 
of momentum —p and po = 9; it is thus part of a field of left chirality. The 
other half of modes for this field comes from modes near —(M — 1)/2, for 
which we may write pa ~ —1 + pa as M — oo. In this case, the momentum 
will be p and pp = —p consistent with left chirality again. Thus, even though 
we started from the lattice version of what appeared to be a purely right 
chirality field, we see that we end up getting both chiralities as M — oo, 
a — 0, due to extra contributions from modes near the edge of the Brillouin 
zone. This is the essence of fermion doubling. 
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This result is quite general. It can be formulated in terms of the action 
with latticization of the time direction as well. For every direction, we get 
both chiralities, leading to a multiplicity of 24 = 16 in four dimensions. The 
generality of this doubling is captured by the Nielsen-Ninomiya theorem. Let 
the equation for the lattice Dirac field be of the form 


ce 


ie = He voy) (19.122) 


We consider the following three conditions on the lattice theory: 


1. The theory is local in the sense that the Hamiltonian H(z — y) goes to 
zero as |x—y| — oo such that the Fourier transform of H has a continuous 
first derivative. 

2. The action and Hamiltonian have translational invariance under transla- 
tions by lattice vectors. 

3. The Hamiltonian H(z — y) as a matrix, with z,y as labels for matrix 
elements, is hermitian (or the action is real). 


The Nielsen-Ninomiya theorem then states that given a lattice fermion theory 
obeying these three conditions, the fermion modes are in pairs of left and right 
chirality, as M — co, a > 0. 

This theorem shows that it is rather difficult to construct lattice theories 
for chiral fermions. One is forced to give up one of the three premises of the 
theorem. A number of different approaches have been proposed for construct- 
ing chiral theories on the lattice. While they are important for the standard 
model and for questions of chiral symmetry breaking for QCD, they generally 
lead to a considerable increase in computational complexity. With presently 
available computers, we are still only at the threshold of fully dynamical 
numerical computations relevant to such questions. 
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20 Elements of Geometric Quantization 


This is a special chapter on some aspects of geometric quantization. The first 
part of this discussion has overlap with Chapter 3 on canonical quantization. 
In Chapter 15, we considered some topological issues in quantization from a 
path integral point of view. The more canonical point of view of the same 
topological features is discussed here. Some examples from mechanics, as well 
as from field theory, are given. The example of the gauge theory configuration 
space has significant overlap with Chapter 16, but is included, and partly 
repeated, here to show how it fits in with the canonical framework. 


20.1 General structures 


1. Symplectic form, canonical transformations, and Poisson brackets 


We shall first consider the formulation of theories in the symplectic lan- 
guage. The question of deriving this from an action formulation will be dis- 
cussed later. 

In the analytical formulation of classical physics, one starts with a phase 
space, i.e., a smooth even-dimensional manifold M endowed with a sym- 
plectic structure 92. 2 is a differential two-form on M which is closed and 
nondegenerate. This means that d(2 = 0, and further, for any vector field € 
on M, if ig§2 = 0, then € must be zero. In local coordinates q’, on M, we 
can write 


= ; Qyvdq" A dq” (20.1) 
The condition df2 = 0 becomes 
a= 1OQyy gga A dq’ A dq” 
20g" 
=— i a as Ge, 
3! | Og@ Fe Oq’ a Ogu S : ? 
=0 (20.2) 


The interior contraction of 2 with a vector field € = £“(0/0q") is given by 
te 82 = EF Quy dg” (20.3) 
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The vanishing of this is the condition €“2,, = 0. Thus if 2 is degenerate, 
2,,,, considered as a matrix, has a zero mode €4. Nondegeneracy of {2 is thus 
equivalent to the invertibility of 2, as a matrix. When needed, we denote 
the inverse of §2,,, by 24”, i.e., 


Quy QO = bf (20.4) 


onterd 
(We will consider 92’s which are nondegenerate. If (2 has zero modes, one 
has to eliminate them by constraining the variables, or equivalently, one has 
to project 2 to a smaller space where there is no zero mode and use this 
smaller space for setting up the quantum theory. Such a situation occurs in 
gauge theories. In general, zero modes of {2 indicate the existence of gauge 
symmetries. ) 

With the structure (2 defined on it, M is a symplectic manifold. 

Since 2 is closed, at least locally we can write 


QN=dA : (20.5) 


The one-form so defined is called the canonical one-form or symplectic po- 
tential. There is an ambiguity in the definition of A, since A and A+dA 
will give the same 2 for any function A on M. As we shall see shortly, this 
corresponds to the freedom of canonical transformations. 

There are two types of features associated with the topology of the phase 
space which are apparent at this stage. If the phase space M has nontrivial 
second cohomology, i.e., if H7(M) 4 0, then there are possible choices for 
§2 for which there is no globally defined potential. The action, as we shall 
see later, is related to the integral of A, so if 22 belongs to a nontrivial 
cohomology class of M, then the definition of the action requires auxiliary 
variables or dimensions. Such cases do occur in physics and correspond to the 
Wess-Zumino terms discussed in Chapter 17. They are intrinsically related 
to anomalies and also to central (and other) extensions of the algebra of 
observables. 

Even when H?(M) = 0, there can be topological problems in defining A. 
If H1(M) # 0, then there can be several choices for A which differ by elements 
of H!(M). There are inequivalent A’s for the same 2. One can consider the 
integral of A around closed noncontractible curves on M. The values of these 
integrals or holonomies will be important in the quantum theory as vacuum 
angles. The standard 6-vacuum of nonabelian gauge theories is an example. 
We take up these topological issues in more detail later. 

Given the above-defined geometrical structure, transformations which 
preserve {2 are evidently special; these are called canonical transformations. 
In other words, a canonical transformation is a diffeomorphism of M which 
preserves (2. Infinitesimally, canonical transformations are generated by vec- 
tor fields € such that L¢{2 = 0, where L¢ denotes the Lie derivative with 
respect to €. This gives 
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Led = (dig + te d) 2 
= d (ie) 
=0 (20.6) 


where we have used the closure of 2. For canonical transformations, ig is 
closed. If the first cohomology of M is trivial, we can write 


igQ = —df (20.7) 


for some function f on M. In other words, to every infinitesimal canonical 
transformation, we can associate a function on M. If H!(M) 4 0, then there 
is the possibility that for some transformations €, the corresponding ig? is a 
nontrivial element of H1(M) and hence there is no globally defined function f 
for this transformation. As mentioned before this is related to the possibility 
of vacuum angles in the quantum theory. For the moment, we shall consider 
the case H!(M) = 0. Notice that for every function f we can always associate 
a vector field by the correspondence 


éH = 26, f (20.8) 


Thus when ‘(M) = 0 there is a one-to-one mapping between functions on 
M and vector fields corresponding to infinitesimal canonical transformations. 
A vector field corresponding to an infinitesimal canonical transformation is 
often referred to as a Hamiltonian vector field. The function f defined by 
(20.7) is called the generating function for the canonical transformation cor- 
responding to the vector field. 

Let €, 7 be two Hamiltonian vector fields, so that LgQQ = L, 2 = 0 and 
let their generating functions be f and g, respectively. Since Lg L,, — L, Lg = 
Li¢,n}, the Lie bracket of € and 7 so defined is also a Hamiltonian vector field. 
The Lie bracket of € and 7 is given in local coordinates by 


[E, n]* = €”O,n* — "OE" (20.9) 


We must therefore have a function corresponding to [£, 7]. This is called the 
Poisson bracket of g and f and is denoted by {g, f}. (There is a minus sign 
in this correspondence; £ ~ f, 7  g and [£,7] ~ {g,f}.) We define the 
Poisson bracket as 


{fg} = tgin ® = 1S Quy 


—igdg = i, af 
= 2” 0, fag (20.10) 


Because of the antisymmetry of 92,,, we have the property 


{f,9} = —{9, f} (20.11) 
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From the definition of the Poisson bracket, we can write, using local coordi- 
nates, 


2 Oat f,9} = Oa(n - OF — €- Og) 
= aN On f a n’ (OOF) = Dab" Ong = E" (0,009) 
= Oan Ou f — Aub" Ong +7 - OE" Day) —€ : O(n Lay) 
= Oa Ouf — Oak"Oug + (E- On — 9 OE)" Que 
te ne’ (O,Qav aE OySapen) 
= [€, mn)" Que + Oa (mY E” Quy) + ME” (Op Qav + Oy Qype + Oa By) 
= [es TP Que a Oatf, 9g} —F mE” (Op Qe a Oy Que oe On Qi) 
(20.12) 


In local coordinates, the closure of 92 is the statement 0,2av + QQya + 
Oa 2 yp, = 0. We then see that 


—d{g, Jet = te,n) ‘0; : (20.13) 


which shows the correspondence stated earlier. 

Consider the change in a function F’ due to a canonical transformation 
generated by a Hamiltonian vector field € corresponding to the function f. 
This is given by the Lie derivative of F with respect to €. Thus 


6F = £40, F = {F, f} (20.14) 


The transformation is given by the Poisson bracket of F with the generating 
function f corresponding to €. 

An important property of the Poisson bracket is the Jacobi identity for 
any three functions f, g, h, 


{f,{9,h}} + th, {f, 9}} + {9, {h, f}} =0 (20.15) 


This can be verified by direct computation from the definition of the Poisson 
bracket. In fact, if €,7, are the Hamiltonian vector fields corresponding to 
the functions f,g,h, then 


{f, {9 A}}+ {h, {f, 9}} + (9, {h, f}} = —teintpd 2 (20.16) 


and so the Jacobi identity follows from the closure of (2. This result is also 
equivalent to (20.13, 20.14). Acting on a function F 


(LeLy = L,L¢)F an {{F 9}; f} a ae, ft, 9} 
a Sa Hee F} 
oe (20.17) 


where we have used the Jacobi identity in the second step. 
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The change of the symplectic potential A under an infinitesimal canonical 
transformation can be worked out as 


bA=LeA 
= d(igA— f) (20.18) 


where we used the definition Le = (d ig + i¢ d) and equation (20.7). Thus 
under a canonical transformation A — A+dA, A = igA— f. Evidently 
dA = §2 is unchanged under such a transformation. This suggests that we 
may think of A as a U(1) gauge potential and (2 as the corresponding field 
strength. The transformation A — A+ dA is a gauge transformation. 

Finally we note that the symplectic two-form defines a volume form on 
the phase space M by 


QAKQNA---KL 
(27)” 


2 2n 
= ar aee (=) d°"q (20.19) 


where we take the n-fold product of §2’s for a 2n-dimensional phase space. 
(c is a constant which is undetermined at this stage.) If the dimension of the 
phase space is infinite, then a suitable regularized form of the determinant 
of (2, has to be used. The volume measure defined by equation (20.19) is 
called the Liouville measure. 

Finally, notice from the last line of equation (20.10) that for the phase 
space coordinates we have {q",q”} = 24”. This is often interpreted as saying 
that the basic Poisson brackets are the inverse of the symplectic structure. 


do(M)=c 


2. Darboux’s theorem 


A useful result concerning the symplectic form is Darboux’s theorem 
which states that in the neighborhood of a point on the phase space it is 
possible to choose coordinates p;,z;, i = 1,2,---,n, (which are functions of 
the coordinates g“ we start with) such that the symplectic two-form is 


Q = dp; A dz; (20.20) 


Evidently from the above equation, we see that the Poisson brackets in terms 
of this set of coordinates are 


{z;,x;} =0 
{xi, pj} = 0:3 
{p:,p;} =0 (20.21) 


This is the standard separation of the phase space coordinates into momenta 
and configuration space coordinates; Darboux’s theorem is clearly an impor- 
tant result. The standard proof of the theorem is by induction; an outline is 
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as follows. (This is very elegantly discussed in Arnold’s book.) Let B denote 
the point on phase space in the neighbourhood of which we want to reduce 
Q to the Darboux form. As the first coordinate p;, we can take any noncon- 
stant function of the coordinates q such that its differential dp; is not zero 
at B. One can assume, without loss of generality, that p; = 0 at the point 
B. Associated to pi(q) there is a Hamiltonian vector field 


Opi 

jie 

P= La (20.22) 
We now choose a (2n — 1)-dimensional surface X’ transverse to the vector P; 
and passing through the point B. The equation 


dq" _ _ gu 9P1 

rs 2 aq? (20.23) 
defines the flow due to the vector field P,; these flow lines intersect the surface 
+) transversally. Consider any point q near 2’ but not necessarily on it. We 
can solve (20.23) with q as the initial point and choose the direction such 
that the motion is toward the surface »’. At some value 7 determined by the 
initial point q, this motion arrives at +’. This particular value of 7, viewed 
as a function of the initial point qg, we denote by x;. Equation (20.23) shows 
that for this function x(q), we have 


{x1,p1} i - (20.24) 


(Since we are considering 7 as a function of the initial value gq, there is an 
additional minus sign in differentiations. g’ on the left-hand side of (20.23) 
is the moving point, the derivative 07/Oqj,,;,;,, is what we want for the Pois- 
son bracket. This eliminates the minus sign in (20.23).) p1,21 give the first 
canonical pair of coordinates. Notice that 2; = 0 for points on ». 

Consider the surface ©* defined by p,; = 0, x; = 0. The differentials 
dp;, dx, are linearly independent since their Poisson bracket is nonzero. In 
fact if X, is the vector field corresponding to x1, we have ip,ix, 2 = —1. Let 
€ be any vector field which induces a flow along (tangential to) X*. Such a 
vector cannot change the value of p;,21, so we have igdp; = 0, igdx; = 0. 
Equation (20.24) and this result show that we can write 


Q=2M + dp, Adz, ; (20.25) 


{2* does not involve differentials dp; or dr,. Evidently, dQ* = 0. Further the 
contraction with {2* of any vector tangential to X* is the same as its con- 
traction with (2. Thus 2* must be invertible for vectors tangential to 5*. 2* 
therefore defines a symplectic two-form on Y*, which is (2n — 2)-dimensional. 
The problem is reduced to the question of choosing Darboux coordinates on 
the lower dimensional space. We can now proceed in a similar manner, start- 
ing with X* and 2*, constructing another canonical pair po, r2, obtaining a 
reduction to a (2n—4)-dimensional subspace, and so on inductively, to prove 
the theorem. 
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20.2 Classical dynamics 


The time-evolution of any quantity is a particular canonical transformation 
generated by a function H called the Hamiltonian; this is the essence of the 
Hamiltonian formulation of dynamics. If F is any function on M, we then 


have 
OF 


er = {FH} (20.26) 


Specifically for the local coordinates on M we can write 


Og" a 


OH 
= Qh’ —— 20.2 

= (20.27) 
Since §2 is invertible, we can also write this equation as 
Oq’ OH 
Ot Oqe 
At this point we can relate this to an action and a variational principle. We 
define the action as 


Ge (20.28) 


ty dq" 
s= dt (A= — #) (20.29) 
. dt 
where g(t) gives a path on M. Under a general variation of the path g(t) > 
g#(t) + €4(t), the action changes by 


Ay de gy , 4, 46 _ OF oy 
— fe Fs fae as ae 


dq” 
ral a ih dt (2. vn - 5a) e (20.30) 


The variational principle says that the equations of motion are given by the 
extremization of the action, i.e., by 6S = 0, for resricted set of variations 
with the boundary data (initial and final end point data) fixed. From the 
above variation, we see that this gives the Hamiltonian equations of motion 
(20.28). There is a slight catch in this argument because q” are phase space 
coordinates and obey first-order equations of motion. So we can only specify 
the initial value of g“. However, the Darboux theorem tells us that one can 
choose coordinates on neighborhoods of M such that the canonical one-form 
A is of the form p,;dz’* for each neighborhood. Therefore, instead of specifying 
initial data for all g“, we can specify initial and final data for the r*’s. The 
€" in the boundary term is thus just dx’. Since the boundary values are 
kept fixed in the variational principle dS = 0, we may set éz* = 0 at both 
boundaries and the equations of motion are indeed just (20.28). 
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We have shown how to define the action if 2 is given. However, going 
back to the general variations, notice that the boundary term is just the 
canonical one-form contracted with €“. Thus if we start from the action as 
the given quantity, we can identify the canonical one-form and hence §2 from 
the boundary term which arises in a general variation. In fact 


, : dq’ "OH ja 
oS igA(t yf) _ ig A(t; ) 2 je (Ef _ oa e (20.31) 
We have used this equation in Chapter 3 to identify the canonical one-form 
and carry out the canonical quantization. 


20.3 Geometric quantization 


In the quantum theory, the algebra of observables is an operator algebra with 
a Hilbert space which provides an irreducible unitary representation of this 
algebra. The allowed transformations of variables are then unitary transfor- 
mations. There are thus two essential points to quantization: 1) a correspon- 
dence between canonical transformations and unitary transformations and 2) 
ensuring that the representation of unitary transformations on the Hilbert 
space is irreducible. Since functions on phase space generate canonical trans- 
formations and hermitian operators generate unitary transformations, we get 
a correspondence between functions on phase space and operators on the 
Hilbert space. The algebra of Poisson brackets will be replaced by the alge- 
bra of commutation rules. The irreducibility leads to the necessity of choosing 
a polarization for the wave functions. 

Before considering different aspects of the operator approach, we shall 
start with the notion of the wave function. In the geometric approach to the 
wave function, the first step is the so-called prequantum line bundle. This 
is a complex line bundle on the phase space with curvature ‘2. Sections of 
this line bundle form the prequantum Hilbert space. In more practical terms, 
this means that we consider complex functions ¥(q) on open neighbourhoods 
in M, with a suitably defined notion of covariant derivatives such that the 
commutator of two covariant derivatives gives 92. Since 2 is closed, at least 
locally we can write {2 = dA, where A is the symplectic potential. Under 
a canonical transformation, (2 does not change, but the symplectic potential 
transforms as A — A’ = A+ dA, as we have seen in (20.18). In other 
words, A undergoes a U(1) gauge transformation. The statement that W’s 
are sections of a line bundle means that locally they are complex functions 
which transform as 

VW’ =exp(iA) v (20.32) 


As mentioned before, one may think of A as a U(1) gauge potential; W’s are 
then like matter fields. The above equation is equivalent to the requirement 
of canonical transformations being implemented as unitary transformations. 
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The transition rules for the W’s from one patch on M to another are likewise 
given by exponentiating the transition function for A. The functions W’s so 
defined, which are also square-integrable, form the prequantum Hilbert space; 
the inner product is given by 


(1)2) = | do(M) W* Wy (20.33) 


where do(M) is the Liouville measure on the phase space, defined by 2. 

One can, at the prequantum level, introduce operators corresponding to 
various functions on the phase space. A function f(q) on the phase space 
generates a canonical transformation which leads to the change A = igA — f 
in the symplectic potential. The corresponding change in W is thus 


ow = +0, — i( igA— f)w 
= €4# (8, -—iA,) V+ ifw 
— Go epaee sy) 2 (20.34) 


where the first term gives the change in W considered as a function and the 
second term compensates for the change of A. In the last expression D,, are 
covariant derivatives 0, — 1%A,; these are the appropriate derivatives to 
consider in view of the U(1) gauge symmetry on the wave functions. The 
gauge potential to be used is indeed A, so that the curvature is 2. Based on 
(20.34), we define the prequantum operator corresponding to f(q) by 


P(f) =—i(€-D + if) 
=-if-D +f : (20.35) 


We have seen that if the Hamiltonian vector fields for f, g, are € and 7, 
respectively, then the vector field corresponding to the Poisson bracket {f, g} 
is —[€,7]. From the definition of the prequantum operator above, we then 
find 


[P(f), P(9)] = [-#€ -D + f,-in- D+ 9] 
ee [E" Dy, n’ Dy] ai las [D,, 9] + int [Du f] 
= i€PyY Quy — (E* Onn’) Dy + (nh OnE") Dy — 1" Ong + in? On F 
= i(—€4 1’ Quy + if, n] - D) 
= i (-# (if,,¢DP) + {f,9}) 
= iP({f,9}) (20.36) 


In other words, the prequantum operators form a representation of the Pois- 
son bracket algebra of functions on phase space. 

The prequantum wave functions WY depend on all phase space variables. 
The representation of the Poisson bracket algebra on such wave functions, 
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given by the prequantum operators, is reducible. A simple example is suffi- 
cient to illustrate this. Consider a point particle in one dimension, with the 
symplectic two-form 2 = dp A dx. We can choose A = pdx. The vector fields 
corresponding to x and p are €, = —0/Op and &, = 0/0z. The corresponding 
prequantum operators are 


Pe = ix seg 
6) 
= —j— 20.37 
Plp) = ~i5, (20.37) 
which obey the commutation rule 
[P(x), P(p)] =i (20.38) 


We clearly have a representation of the algebra of P(x), P(p) in terms of 
prequantum functions Y(z,p). But this is reducible. For if we consider the 
subset of functions on the phase space which are independent of p, namely, 
those which obey the condition 


= =O, (20.39) 


then the prequantum operators become 
Pick— 2 


nO 
P(p) = -i De (20.40) 
which obey the same algebra (20.38). Thus we are able to obtain a repre- 
sentation of the algebra of observables on the smaller space of W’s obeying 
(20.39), showing that the previous representation (20.37) is reducible. 

In order to obtain an irreducible representation, one has to impose sub- 
sidiary conditions which restrict the dependence of the prequantum wave 
functions to half the number of phase space variables. This is the choice of 
polarization and generally leads to an irreducible representation of the Pois- 
son algebra. For the implementation of this we need to choose a set of n 
vector fields P*, i = 1,2...n, so that 


Quy PPP? =0 (20.41) 
and impose the condition 
POD. = 0 (20.42) 


The vectors P!* define the polarization. The wave functions so restricted are 
the true wave functions. 

The next step is to define an inner product, and restrict to square- 
integrable functions, so that these wave functions form a Hilbert space. While 
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the volume element and the notion of inner product can be defined on the 
phase space in terms of 92, generally, there is no natural choice of inner prod- 
uct once we impose the restriction of polarization. However, there is one case 
where there is a natural inner product on the Hilbert space. This happens 
when the phase space is also Kahler and 2 is the Kahler form or some mul- 
tiple thereof. In this case we can introduce local complex coordinates and 
write 


Q = NQaadz® A dx® (20.43) 
a,@ =1,2...n. The covariant derivatives are of the form 
De = a, ae PA, 
Da = 05 — iAg (20.44) 


For a Kahler manifold, there is a Kahler potential K defined by 


a 


Aa = — 500K 


a 


5 eee 50% K (20.45) 


In this case, one can choose the holomorphic polarization 
1 
Dav = (05 + 30a )Y —<) (20.46) 


which gives 
W =exp(—5K) F (20.47) 


where F is a holomorphic function on M. The wave functions are thus holo- 
morphic, apart from the prefactor involving the Kahler potential. In this case 
the inner product of the prequantum Hilbert space can then be retained, up 
to a constant of proportionality, as the inner product of the Hilbert space; 
specifically 


(1/2) = / do(M) e-* F*F, (20.48) 


Almost all the cases of interest to us are of this type. 

Once the polarized wave functions are defined, the idea is to represent ob- 
servables as linear operators on the wave functions as given by the prequan- 
tum differential operators. Let € be the Hamiltonian vector field correspond- 
ing to a function f(g). If the commutator of € with any polarization vector 
field P; is proportional to P, itself, i.e., [€, P;] = C? P; for some functions C?, 
then, evidently, € does not change the polarization. €W obeys the same polar- 
ization condition as W. In this case the operator corresponding to f(q) is given 
by P(f). For operators which do not preserve the polarization, the situation 
is more involved. Since operators corresponding to observables are generators 
of unitary transformations, we must construct directly infinitesimal unitary 
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transformations whose classical limit is the required canonical transforma- 
tion and identify the quantum operator from the result. This will require, 
in general, a pairing between wave functions obeying different polarization 
conditions. 


20.4 Topological features of quantization 


We now turn to some of the topological features of phase space and their 
effects on quantization. As we have mentioned before two of the key topolog- 
ical problems have to do with the first and second cohomology of the phase 
space. 


1. The case of nontrivial 11(M, R) 


Consider first the case of H'(M, R) # 0. In this case for a given symplec- 
tic two-form 92, we can have different symplectic potentials. A and A+ A 
lead to the same 2 if A is closed, i.e., if dA = 0. If A is exact so that A = dh 
for some globally defined function h on M, then the function h is a canonical 
transformation, physical results are unchanged, and so this is equivalent to 
A = 0 upon carrying out a canonical transformation. If A is closed but not 
exact, which is to say that if it is a nontrivial element of the cohomology 
H'(M,R), then we cannot get rid of it by a canonical transformation. Lo- 
cally we can write A = df for some f, but f will not be globally defined on 
M. Classical dynamics, which is defined by 92 as in the equations of motion 
(20.27), will not be affected by this ambiguity in the choice of the symplectic 
potential. In the quantum theory such A’s do make a difference. We see this 
immediately in terms of the action. The action for a path C, parametrized 
as q’(t) from a point a to a point 6 is 


dq’ g 


The action depends on the path but the contribution from A is topological. 
If we change the path slightly from C to C’ with the end points fixed, we 
find, using Stokes’ theorem, 


fa-f A=$ el 
(O} u (G=CH 
= faa 

Py 

=0 


where C' — C” is the path where we go from a to b along C and back from b 
to a along C’. (Since we are coming back the orientation is reversed, hence, 
the minus sign.) Y is a surface in M with C — C’ as the boundary. The 


(20.50) 
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above result shows that the contribution from A is invariant under small 
changes of the path. (This is not true for the other terms in the action.) 
In particular, the value of the integral is zero for closed paths so long as 
they are contractible; for then we can make a sequence of small deformations 
of the path (which do not change the value) and eventually contract the 
path to zero. If there are.noncontractible loops, then there can be nontrivial 
contributions. If H!(M,R) # 0, then there are noncontractible loops. In the 


quantum theory, it is e** which is important, so we need il 4’ Assume for 
simplicity that H'(M,R) = Z so that there is only one topologically distinct 
noncontractible loop apart from multiple traversals of the same. Let A = 0a, 
where @ is a constant and a is normalized to unity along the noncontractible 
loop for going around once. For all paths which include n traversals of the 


loop, we find 
exp ( $ A) = exp (i# $ a) = exp (70n) (20.51) 


Notice that a shift 9 — @+ 27 does not change this value, so that we may 
restrict @ to be in the interval zero to 27. Putting this back into the action 
(20.49), we see that, considered as a function on the set of all paths, the action 
has an extra parameter 6. Thus the ambiguity in the choice of the symplectic 
potential due to H'(M, R) # 0 leads to an extra parameter 6 which is needed 
to characterize the quantum theory completely. If H!(M,R) is not just Z 
and there are more distinct paths possible, then there can be more such 
parameters. 

It is now easy to see these results in terms of wave functions. The relevant 
covariant derivatives are of the form D,Y = (0, —iA, —iA,)¥. We can write 


0) ate ( / : A) &(q) (20.52) 


where the lower limit of the integral is some fixed point a. By using this 
in the covariant derivative, we see that A is removed from D, in terms of 
action on @. This is like a canonical transformation, except that the relevant 
transformation exp (i [7 A) is not single valued. As we go around a closed 
noncontractible curve, it can give a phase e*’. Since W is single valued, this 
means that & must have a compensating phase factor; © is not single valued 
but must give a specific phase labeled by 6. Thus we can get rid of A from 
the covariant derivatives and hence the various operator formulae, but diag- 
onalizing the Hamiltonian on such @’s can give results which depend on the 
angle @. 

The 9-parameter in a nonabelian gauge theory is an example of this kind 
of topological feature. The description of anyons or particles of fractional 
statistics in two spatial dimensions is another example. 


2. The case of nontrivial H?(M, R) 
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We now turn to the second topological feature, the case of H?(M,R) 
# 0. In this case there are closed two-forms on M which are not exact. 
Correspondingly, there are closed two-surfaces which are not the boundaries 
of any three-dimensional region, i.e., noncontractible closed two-surfaces. In 
general, elements of H?(M,R) integrated over such noncontractible two- 
surfaces are not zero. If {2 is some nontrivial element of H?(M,R), then the 
symplectic potential cannot be globally defined. We see this easily as follows. 
Consider the integral of 2 over a noncontractible closed two-surface 2’, 


wien a 2 (20.53) 


If 5” is a small deformation of 5’, then 
(5) — 1(5") = | os / d2 =0 (20.54) 
pres 3ys V 


where V is a three-dimensional volume with the two surfaces 1 and ” as 
the boundary. Thus the integral of 2 is a topological invariant, invariant 
under small deformations of the surface on which it is integrated. If we can 
write (2 as dA for some A globally defined on then clearly [(2’) is zero 
by Stokes’ theorem. Thus if J(2’) is nonzero, then A cannot be globally de- 
fined on 4’. We have to use different functions to represent A in different 
coordinate patches, then have transition functions relating the A’s in overlap 
regions. Even though we may have different definitions of A in an overlap 
region corresponding to the different patches which are overlapping, (2 is the 
same, and so the transition functions must be canonical transformations. As 
an example, consider a closed noncontractible two-sphere, or any smooth de- 
formation of it, which is a subspace of M. We can cover it with two coordinate 
patches corresponding to the two hemispheres, denoted N and S as usual. 
The symplectic potential is represented by Ay and Ag, respectively. On the 
equatorial overlap region, they are connected by 


An =As+dA (20.55) 


where A is a function defined on the overlap region. It gives the canonical 
transformation between the two A’s. 

Since A is what is used in setting up the quantum theory and since, in 
particular, the canonical transformations are represented as unitary transfor- 
mations on the wave functions, we see that we must have a Wy for the patch 
N and a Wg for the patch S. On the equator they must be related by the 
canonical transformation, which from (20.32), is given as 


Wy =exp(iA) Wg : (20.56) 


We now consider the integral of dA over the equator E, which is a closed 
curve being the boundary of either N or S. From (20.55) this is given as 
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aa=$ d= f Ay f As 
E E E 
=| An + As 
ON os 
-fa+ fo 
N st 
=| 2 (20.57) 
ay 


In the second step, we reverse the sign for the S-term because E considered as 
the boundary of S has the opposite orientation compared to itself considered 
as the boundary of N. The above equation shows that the change of A as 
we go around the equator once, namely, AA, is nonzero if [(2’) is nonzero; 
A is not single valued on the equator. But the wave function must be single 
valued. From (20.56), we see that this can be achieved if exp(iA4A) = 1 or 
if AA = 2rn for some integer n. Combining with (20.57), we can state that 
single-valuedness of wave functions in the quantum theory requires that 


i, Q = 2xn (20.58) 
D3 


The integral of the symplectic two-form on closed noncontractible two- 
surfaces must be quantized as 27 times an integer. We have given the ar- 
gument for surfaces which are deformations of a two-sphere, but a similar 
argument can be made for noncontractible two-surfaces of different topology 
as well. The result (20.58) is quite general. 

The typical example of this kind of topological feature is the motion of a 
charged particle in the field of a magnetic monopole. The condition (20.58) 
is then the Dirac quantization condition. The Wess-Zumino terms occuring 
in many field theories are another example. 


20.5 A brief summary of quantization 


In summary, the key features of the quantization of a system using the holo- 
morphic polarization are the following: 

1) We need a phase space which is also Kahler; the symplectic two-form being 
a multiple of the Kahler form. 

2) The polarization condition is chosen as Dg VY = 0. 

3) The inner product of the prequantum Hilbert space, which is essentially 
square integrability on the phase space, is retained as the inner product on 
the true Hilbert space in the holomorphic polarization. 

4) The operator corresponding to an observable f(q) which preserves the 
chosen polarization is given by the prequantum operator P(f) acting on the 
true (polarized) wave functions. 
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5) For observables which do not preserve the polarization, one has to con- 
struct infinitesimal unitary transformations whose classical limits are the re- 
quired canonical transformations. 

6) If the phase space M has noncontractible two-surfaces, then the integral 
of 92 over any of these surfaces must be quantized in units of 27. 

7) If H'(M,R) is not zero, then there are inequivalent A’s for the same 2 
and we need extra parameters to specify the quantum theory completely. 


20.6 Examples 


20.6.1 Coherent states 


For a one-dimensional quantum system, 92 = dp A dz = idz A dz, where 
(p + iz) //2 = z,Z. Choose 


A= s(z dz —z dz) ; (20.59) 


The covariant derivatives are 0, — 52 and 0; + 4Z. Holomorphic polarization 
corresponds to P = 0/02, leading to the condition 


(0; + $z)W =0 . (20.60) 
This is equation (20.46) for this example. The solutions are of the form 
see 
Ww =e 27* v(z) (20.61) 


where y(z) is holomorphic in z. 
The vector fields corresponding to z, Z are 


owt as 
Zo Fit 20.62 
ay (20.62) 


These commute with P = 0/02 and so are polarization-preserving. The pre- 
quantum operators corresponding to these are 


Pe) =-i(-i) (F the) +e--Z be 
P(z) = —i(%) ae +Z= ok (20.63) 
pe Oz ? 


In terms of their action on the functions y(z) in (20.61) corresponding to W’s 
obeying the polarization condition, these can be written as 
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Plz) vlz) =z (2) 


Plz) v(z) = (20.64) 


The inner product for the y(z)’s is 


dzndz., 
(ja) = fi aS Ws 


G2 ey 
= i pO et gs (20.65) 


What we have obtained is the standard coherent state (or Bargmann) real- 
ization of the Heisenberg algebra. 


20.6.2 Quantizing the two-sphere 


We consider the phase space to be a two-sphere S? ~ CP? considered as 
a Kahler manifold. Using complex coordinates z = x + iy,Z = x — iy, the 
standard Kahler form is 


dz \ dz 
The metric is given by ds? = ele! + e?e” where the frame fields are 
dx dy 
ee oe 
aa C aieiay (20.67) 
where r? = zz. The Riemannian curvature is R!? = 4e! Ae?, giving the Euler 
number R 
12 
as —— 20. 
x — (20.68) 


The phase space has nonzero H?(M) given by the Kahler form. As we have 
discussed, the symplectic two-form must belong to an integral cohomology 
class of M to be able to quantize properly. We take 

dz \ dz 
i = 20.69 
on (14+ 22)? ( ) 
where n is an integer. In this case, if mu £2 = 277 as required by the quantiza- 
tion condition. The symplectic potential can be taken as 


in ? dz—z =] 


ike. a (20.70) 


The covariant derivatives are given by 0 —iA. The holomorphic polarization 
condition is 


: nr z 
(Oz a iAz)V = 2. Se Sots | ve=() (20.71) 
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This can be solved as 
Ww = exp (-5 log(1 + z2)) f(z) (20.72) 


Notice that nlog(1 + 2Z) is the Kahler potential for 2. The inner product is 


given by — 
: INU GA ue OR SSG 
(1]2) mia | Saag iey2 (20.73) 


Here a is an overall constant, which can be absorbed into the normalization 
factors for the wave functions. Since f(z) in (20.72) is holomorphic, we can see 
that a basis of nonsingular wave functions is given by f(z) = 1, z, 2?,---, 2"5 
higher powers of z will not have finite norm. The dimension of the Hilbert 
space is thus (7+ 1). We could have seen that this dimension would be finite 
from the semiclassical estimate of the number of states as the phase volume. 
Since the phase volume for M = S? is finite, the dimension of the Hilbert 
space should be finite. 

It is interesting to see this dimension in another way. The polarization 
condition (20.71) is giving the 0-closure of Y with a U(1) gauge field A and 
curvature R12. The number of normalizable solutions to (20.71) is thus given 
by the index theorem for the twisted Dolbeault complex, i.e., 


nde (Oe I, td(M) A Ch(V) (20.74) 


where, for our two-dimensional case, the Todd class td(M) is R/4m and the 
Chern character Ch(V) = Tr(e‘F/2™) = Tr(e®/2) is f 2/27 for us. We thus 
have 


= 2 R 
index(Oy) = i = ae , = 
=nt+1 - (20.75) 


Notice that, semiclassically, we should expect the number of states to be 
f{ Q/2x =n. The extra one comes from the Euler number in this case. 
An orthonormal basis for the wave functions may be taken to be 


1 
a nl ee 
fx(z) = E Fe ai a (20.76) 
with the inner product, for two such functions f, g, 
: dz \ dz a 


Notice that this is the same as (20.73) but with a specific choice of a = n+1. 
This is the value which gives Trl = n+1 as expected for an (n+1)-dimensional 
Hilbert space. 
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Classically the Poisson bracket of two functions F and G on the phase 
space is given by 


{F,G} = 2"8,F8,G 


= “(1 + 22)? (55 2 eo) (20.78) 
Consider now the vector fields 
co (Bo 
(= 3 («5 _ 75 | (20.79) 


It is easily verified that these are the standard SU (2) isometries of the sphere. 
The Lie brackets of the €’s give the SU(2) algebra, up to certain factors of 
+i compared to the standard form, having to do with how we have defined 
the €’s; we have put in certain multiplicative constants so that the quantum 
algebra has the standard form. Further, the €’s are Hamiltonian vector fields 
corresponding to the functions 


Z 
os 1+ 2z 
ve 
J 1+ 2z 
n {1—2zz 
=. (|e 20.80 
Js 2, (=) ( ) 


The prequantum operators —ifé -D+ J corresponding to these functions are 


= (fopoepeee), _ 40; 
P(J,) = (< Oz et | s£i Ds 


n z Pes 
P(J_) = (-2. aS =) = 4? D; 
P(J3) = (<a : = ;) — itiD; (20.81) 


Acting on the polarized wave functions, Dz gives zero. Writing Y as in (20.72), 
we can work out the action of the operators on the holomorphic wave func- 
tions f(z). We get 

Te = 270,-—nz 

te, 

Jz = 20,-5n (20.82) 
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If we define 7 = n/2, which is therefore half-integral, we see that the operators 
given above correspond to a unitary irreducible representation of SU(2) with 
J? = j(j7 +1) and dimension n + 1 = 27 + 1. Notice that there is only one 
representation here and it is fixed by the choice of the symplectic form 92. 

From the symplectic potential (20.70) and from (20.29) we see that an 
action which leads to the above results is - 


n zz— Zz 
= " 20.83 
: is fae 1+ 2z ( ) 


where the overdot denotes differentiation with respect to time. This action 
may be written as 


Sait / @ Trias ae (20.84) 


where g is an element of SU(2) written as a (2 x 2)-matrix, g = exp(io;6,;) and 
o;,1 = 1, 2,3, are the Pauli matrices. In this action, the dynamical variable is 
an element of SU(2). If we make a transformation g — g h, h = exp(io3y), 
we get 


sss—n fay (20.85) 


The extra term is a boundary term and does not affect the equations of 
motion. Thus classically the dynamics is actually restricted to SU(2)/U(1) = 
S*, The choice of parametrization 


il j er 7 e~ 0 
sean (3, 2) [% 0%] ans 
leads to the expression (20.83). 

Even though the classical dynamics is restricted to SU(2)/U(1), the 
boundary term in (20.85) does have an effect in the quantum theory. Con- 
sider choosing y(t) such that y(—oo) = 0 and y(oo) = 2z. In this case 
h(—oo) = h(oo) = 1 giving a closed loop in the U(1) subgroup of SU(2) 
defined by the o3-direction. For this choice of h(t), the action changes by 
—2rn. e** remains single valued and, even in the quantum theory, the extra 
U(1) degree of freedom is consistently removed. If the coefficient were not an 
integer, this would not be the case and we would have inconsistencies in the 
quantum theory. Thus the quantization of the coefficient to an integral value 
is obtained again, from a slightly different point of view. 

So far we have used a local parametrization of S? which corresponds to 
a stereographic projection of the sphere onto a plane. Another more global 
approach is to use the homogeneous coordinates of the sphere viewed as CP?. 
We use a two-component spinor u,, a = 1, 2, with the identification ug ~ Aue 


for any nonzero complex number 4. We also define % = u35, U2 = —u* or 
a ExBUgs where €ag = —€Ga, €12 = 1. The symplectic form is 
du-du wt-duu-du 
2 = —-in - : 
| ai-u (a-u)? | i) 
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where the notation is u-v = ugvgeqg. It is easily checked that Q(Au) = Q(u); 
it is invariant under u — Au and hence is properly defined on CP! rather than 
C? — {0}. The choice of ug/u; = z leads to the previous local parametrization. 


The symplectic potential is 


niu du — du-t 
A=-i® Saas = | (20.88) 
Directly from the above expression we see that 
7 = 
A(\u) = A(u) — d (5 log(X/2)) (20.89) 


This means that A cannot be written as a globally defined one-form on CP?. 
This is to be expected because { §2 4 0 and hence we cannot have a globally 
defined potential on CP’. From (20.32), we see that the prequantum wave 
functions must transform as 


W(Au, Mi) = U(u, a) exp E log(A/2)| (20.90) 


The polarization condition for the wave functions becomes 


O = NnUBEBa 
ae = 20.91 


The solution to this condition is 
W = exp (-5 log(u - u)) f(u) (20.92) 


Combining this with (20.90), we see that the holomorphic functions f(u) 
should behave as 


f(Au) =A” f(u) (20.93) 
f(u) must thus have n u’s and hence is of the form 
[Ca tet. (20.94) 


Because of the symmetry of the indices, there are n+1 independent functions, 
as before. There is a natural linear action of SU(2) on the u, u given by 


where Ug form a (2 x 2) SU(2) matrix. The corresponding generators are 
the J, we have constructed in (20.81, 20.82). 
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20.6.3 Compact Kahler spaces of the G/H-type 


The two-sphere S? = SU(2)/U(1) is an example of a group coset which is a 
Kahler manifold. There are many compact Kahler manifolds which are of the 
form G/H, where H is a subgroup of the compact Lie group G. In particular 
G/H is a Kahler manifold for any compact Lie group if H is its maximal torus. 
The maximal torus is the subspace of G generated by the mutually commuting 
generators. Another set of spaces is given by CP’ = SU(N+1)/U(N). There 
are many other cases as well. 

One can take the symplectic form as proportional to the Kahler form or 
as a combination of the generators of H?(M) and quantize these spaces as we 
have done for the case of S*. In general, they lead to one unitary irreducible 
representation of the group G, the particular choice of the representation 
being determined by the choice of 22. 

In most of these cases, the Kahler form can be constructed in a very 
simple way. As an example consider CP? = SU(3)/U(2). A general element 
of SU(3) can be represented as a unitary (3 x 3)-matrix. We define a U(1) 
subgroup by elements of the form U = exp(iJ*@), where 


1 {i 9 0 


P=-—|0 1 0 - (20.96) 
V6\9 MH 2 


We also define an SU(2) subgroup which commutes with this by elements of 
the form 


h 0 
Asu(2) = ( ae :) (20.97) 


These two together form the U(2) subgroup of SU(3). Consider now the 
one-form 


A(g) = iw Tr(I®g—'dg) (20.98) 


where g is an element of the group SU(3) and w is a numerical constant. If 
h is an element of U(2) Cc SU(3), we find 


A(gh) = A(g) — wd (20.99) 


We see that A changes by a total differential under the U(2)-transformations. 
Thus dA is defined on CP?. Evidently it is closed, but it is not exact since 
the corresponding one-form is not globally defined on CP?, but only on 
G = SU(3). Thus dA is a nontrivial element of H?(CP”). We can use dA as 
the symplectic two-form. The integrals of dA over nontrivial two-cycles on 
CP? will have to be integers; this will restrict the choices for w. Alternatively, 
we take the action to be S = f A; for e*° to be well defined on CP”, we will 
have restrictions on the w. The wave functions are functions on SU (3) subject 
to the restrictions given by the action of SU(2) and U(1). In other words, we 
can write, using the Wigner D-functions 


20.6 Examples 507 


UW ~ Do (9) (20.100) 


Here a is a set of indices which labels the representation; A, B label the states. 
“B(g) is the AB-matrix element of the group element g in the irreducible 
representation of G characterized by the labels a. 
The groups involved in the quotient can be taken as the right action on 
g. The transformation law for A then tells us that Y must transform as 


W(gh) = exp(iw6) Y(g) (20.101) 


This shows that the wave functions must be singlets under the SU(2) sub- 
group, acting on the right of g, and carry a definite charge w under the U(1) 
subgroup. This restricts the choice of values for the index B in (20.100). Fur- 
ther w must also be quantized so that it can be one of the allowed values in the 
unitary irreducible representations of SU(3) in (20.100). Once the indices B 
are chosen this way, the index A is free and so the result of the quantization is 
to yield a Hilbert space which is one unitary irreducible representation of the 
group SU(3). w is related to the highest weights defining this representation. 
(There are many representations satisfying the condition on the charges for 
the subgroup. But the polarization condition, which we have not discussed 
for this problem, will impose further restrictions and choose one representa- 
tion which is a “minimal” one among the various representations allowed by 
the charges.) To do this in more detail, notice that [® = \/3/2 Y, where Y is 
the hypercharge; thus D4p(gU) = Dap exp(i,/3/2 Yp0), Ye being the hy- 
percharge of the state B, identifying w as ,/3/2Yz. Unitary representations 
of SU(3) have hypercharge values quantized in units of 33 SU (2) invariant 
states have hypercharge quantized in units of 2. Equation (20.101) then tells 
us that w must be quantized as n,/2/3, where n is an integer. 

An explicit parametrization can be obtained as follows. Evaluating the 
trace in (20.98), we get 


A= ~iwy/? Un dua (20.102) 
where 993 = Ua. Evidently, uzue = 1 and we can parametrize it as 
1 1 
Lo = aos 21 (20.103) 
1+2Z-z zo 


The symplectic two-form corresponding to the potential (20.102) is 


(f= ~iw/? dus dug 


She fence Rebar beat dams 7edz 
aes; sy || easton inal Seal 20.104 
wwy/3 | ees (l+2-z) ( ) 
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This 92 is proportional to the Kahler two-form on CP”. Requiring that Q 
should integrate to an integer over closed nontrivial two-surfaces in CP? will 
lead to the same quantization condition on w. This is similar to the case of 
the two-sphere or CP!. The polarization condition will tell us that the states 
are functions of u,’s only, not u*’s, and the condition on the U(1)-charge 
will fix the number of u.’s to be n. The states are thus of the form 


WU ~ Ua, Ua. *** Ua (20.105) 


nT 


This corresponds to the rank n symmetric representation of SU(3). One can 
get other representations by other choices of H. 
More generally one can take 


Algo) =a De WaTr(t*g~ dg) (20.106) 


where ¢* are diagonal elements of the Lie algebra of G and wa, are a set 
of numbers. H will be the subgroup commuting with eh Wat"; if we are 
such that all the diagonal elements of >, wat® are distinct, then H will be 
the maximal torus of G. A will change by a total differential under g — 
gh, h € H and dA will be a closed nonexact form on G/H. If some of 
the eigenvalues of )>, wat® are equal, H can be larger than the maximal 
torus. Upon quantization, for suitably chosen w,, we will get one unitary 
irreducible representation of G, and w, will be related to the weights defining 
the representation. 


20.6.4 Charged particle in a monopole field 
The symplectic form for a point particle in three dimensions is 
2 = dpyids, _ (20.107) 


The usual prescription for introducing coupling to a magnetic field involves 
replacing p; by p; + eA;. The Lagrangian has a A;%; added to it, giving 
the canonical one-form mz;dz; + eA;dz;. Writing pj = mz;, the symplectic 
two-form is found to be 

2 = dp; \dx; +eF (20.108) 


where F = dA = 4F,;dz* A dz’ is the magnetic field strength. Thus in 
terms of the symplectic two-form the minimal prescription for introducing 
electromagnetic interactions amounts to adding eF to 2. We can use this 
to discuss a charged particle in the field of a magnetic monopole which has 
a radial magnetic field B, = gx;/r°®, where g is the magnetic charge of the 
monopole and r? = 2;x;. The monopole is taken to be at the origin of the 
coordinate system. Thus 2 is 


1 a0 
2 = dp; \dx;, + 5 eMeish —y Ors arse (20.109) 
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We can identify the basic Hamiltonian vector fields. Contraction of X; = 
—0/Op; with 92 gives —dx; identifying it as the vector field for x*. The con- 
traction of P! = 0/02; gives 


ip-Q = —dp; + egeijn = de; (20.110) 


P’ is not a Hamiltonian vector field, but since we get dz; from contraction 
of 0/Op; with 2 we see that the combination 


Lk O 
ae egEigk 3 a (20.111) 
is a Hamiltonian vector field and corresponds to p;. The Poisson brackets are 
thus 


{tees = —1x, 02; == (() 
{2,73} = —ix,dp; = 63; 
; x 
{pi, Pj} = —ip,dp; = e9esjk = (20.112) 


It is also interesting to work out the angular momentum. Under an in- 
finitesimal rotation by angle 6; the change in the variables x;, p; are 


62; = =k agen 
Opi = —€ijn Pj Or (20.113) 


The corresponding vector field is therefore 
O 0 
€ = —€1jk Pe (sig pe ioe =| (20.114) 


Upon taking the interior contraction of this with 2, we find 


zie 


ig as —€45n OK (—2;dp; = pede; ) = O.eg (OO, — He ——dz™ 
dx _ UE : 
= 0,d(—€:;~2ip;) + Oxeg (= : 7 ar‘) 
k 
= 0.0 (—cssnin; = eg) (20.115) 


The angular momentum which is the generator of rotations is thus 
es 
Ji = €ijk2jPk — eg (20.116) 


This shows that the charged particle has an extra contribution to the angular 
momentum which is radial. In fact £- J = —eg. 

By converting the Poisson brackets to commutators of operators we can 
set up the quantum theory. The only unusual ingredient is the following. The 
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symplectic two-form is singular at r = 0. Thus we need to remove this point 
for a nonsingular description. If we do so, then the space has noncontractible 
two-spheres and so there is a quantization condition for 2. Integrating 2 
over a two-sphere around the origin (which is the location of the monopole), 
we get 47eg. The general quantization condition (20.58) becomes 

eg= > 7 (20.117) 
This is the famous Dirac quantization condition stating that the magnetic 
charge must be quantized in unjts of 1/2e. This value 1/2e is the lowest mag- 
netic charge corresponding to one monopole. The argument for quantization 
also follows from noting that in the quantum theory, the eigenvalues of any 
component of angular momentum have to be half-integral. z- J = —eg then 
leads to the same quantization; this was first noted by Saha. 


20.6.5 Anyons or particles of fractional spin 


We consider relativistic particles in two spatial dimensions. In general, they 
can have arbitrary spin, not necessarily quantized, and hence they are gener- 
ically referred to as anyons (or particles of any spin). 

We will work out some of the theory of anyons starting with a symplec- 
tic structure. A spinless particle may be described by a set of momentum 
variables p* and position variables x°, a = 0,1,2. The canonical structure or 
symplectic two-form is given by 


2 = gap dx* A dp? (20.118) 


where ga, = diag(1, —1, —1) is the metric tensor. For a charged particle, as 
discussed above, the coupling to the electromagnetic field A, by the minimal 
prescription is equivalent to 2 — 2+ eF. 

The motion of the relativistic charged particle is given by the (classical) 
Lorentz equations 


pe 

m dt 
d a 
== = -— Fp, (20.119) 


7 is the parameter for the trajectory of the particle (with mass m). We have 
chosen a specific parametrization or equivalently a gauge-fixing for the gauge 
freedom of reparametrizations of the trajectory and so the equations (20.119) 
are not invariant under reparametrizations. Equations (20.119) tell us that 
the infinitesimal change of 7 is given, on the phase space, by a vector field 


_ pe ) pea ab 0 
~ m Ox ie Poona 


(20.120) 


20.6 Examples 511 


The canonical generator of the r-evolution, say, G, is defined by iy 2 = —dG. 
This gives G = —p?/(2m) + constant. Anticipating the eventual value of the 
constant, we choose it to be m/2. Basically this is the definition of the mass. 
Thus 


ae ae 
G=— 5 (p* —m’) (20.121) 


Since we need reparametrization invariance, the r-evolution must be triv- 
ial. Thus we must set Go to be zero for the classical trajectories. Quantum 
theoretically, this can be implemented by 


GV =0 (20.122) 


This will be the basic dynamical equation of the quantum theory. 
The symplectic form may be written as 


2 = 5Oan dA nag? 


0 Jar 
Qap= o 20.12 
_ (., 20) ee) 


where 4 = (p*, x) denotes both sets of phase space variables. This sym- 
plectic form leads to the commutation rules 


[a*, 2°] =0 
[p, a?| = ig?” 
[p*, p'] = ieFr™ (20.124) 


These relations are solved by 
Pa = 10, + eAg (20.125) 
The condition of trivial r-evolution, namely, (20.122), becomes 
[(O. — ieAq)? + m?|W = 0 (20.126) 


This is the Schrédinger equation (in this case, the Klein-Gordon equation) 
which describes the quantum dynamics of the particle. One can easily show 
that the Lorentz equations (20.119) are quantum mechanically realized by 


ie = [¢4,G] (20.127) 


In a more general situation, one can obtain the Schrodinger-type equation 
as follows. We start with the symplectic two-form. From the equations of 
motion, we find the generator of the r-evolution. Setting this generator to 
zero on the wave functions gives us the equation we are seeking. To realize 
this as a differential equation we must solve the commutation rules in terms 
of a set of coordinates and their derivatives (namely, canonical variables). 
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For a free anyon with spin —s, the symplectic structure is given by 


1 dp? A dp° 
2 = dx* \ dpa t= 8 cabo HSE (20.128) 
The commutation rules are given by 
(a, a?] = is «rhe ( 7 73 
[p*, x? = ig? 
[p?, p’] =0 (20.129) 


Consider the Lorentz generator J° defined by 


be p"| = ier, 
(J? oleic x, (20.130) 


It is easy to see that J® is given by 


a 


VP? 


This shows that the particle has a spin —s, easily seen in the rest frame with 
p° =m, p' = p* =0. The expression for J* is analogous to (20.116) except 
for the different signature for spacetime. Because of this change of signature, 
there is no quantization of the coefficient of the second term in the symplectic 
structure (20.128). The value s is not quantized. Alternatively, there is no 
closed two surface which is not the boundary of a three-volume. We can solve 
the commutation rules (20.129) in terms of canonical variables as 


¢ = q? +a°(p) 


a _ ,,-abe Po"c 
a°(p) = se Pe oir (20.132) 


where'7* = (1,0,0) and [q*,¢°] = 0," [p*)a")=90) [pq | —2genen ge 


jo 


J? = —e* an, — 8 (20.131) 


Pa 
Using (20.132) for z* in (20.131), we can write 


J = abc 0 = pe ae V p?n? 
= “1 ee 
Op Vp+p-n 


We see that p- J + sy/p? = 0. With p? = m?, we see that the spin is indeed 
—s. Thus the symplectic structure 2 of (20.128) is indeed appropriate to 
describe the anyon. 

The symplectic structure 92 for anyons in an electromagnetic field is now 
obtained by 2 — 2+ eF. Thus 


(20.133) 
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p° dp? A dp° 


Varn dpa + 9 S€abe (p2)3/2 


+ 5eFandx* Adz® + O(OF) (20.134) 
With the introduction of spin it is possible that Q has further corrections 
that depend on the gradients of the field strength F. This is indicated by 
O(OF) in the above equation. We shall not discuss the case of anyons in an 
electromagnetic field in any more detail here. One can actually obtain a wave 
equation as indicated and show that the gyromagnetic ratio for anyons is 2. 


20.6.6 Field quantization, equal-time, and light-cone 


We consider a real scalar field y with the action 


Op Op Op Ip 
4 _ 
aS [ae AR Ax° x9 xt Ox’ Uy) 035) 


where U(y) = gmp? +V(y). By considering a general variation and iden- 
tifying the boundary term at the initial and final time-slices, we find, using 
(20:31), 


oe i Br ~ dy (20.136) 


where we use 6 to denote exterior derivatives on the field space and ¢ = 
Oy/Ax°. The time-derivative of ~ must be treated as an independent variable 
since A is at a fixed time. By taking another variation we find the symplectic 
two-form 


Q= ‘ Pr do \ dy (20.137) 


Recalling that the Poisson brackets for the coordinates on the phase space 
are the inverse of the symplectic two-form as a matrix, we find 


ie@a sess) S0 
{ p(x, 2°), p(x’, 2°) } = 6°) (x — 2’) 
{pana \yalea ) } == 0 (20.138) 


Upon replacing the variables by operators with commutation rules given by 
i times the Poisson brackets, we get the standard equal-time rules for quan- 
tization. 

This phase space also has a standard Kahler structure. Consider the fields 
to be confined to a cubical box with each side of length L and volume V. 
With periodic boundary conditions, we can write a set of mode functions as 


UR (eo) = a exp(—ik - a) 
oo (20.139) 


L 
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Here n; are integers. The fields can be expanded in modes as 


g(x) = )) aura) 
k 


P(t) = D7 peur (x) (20.140) 


Se ee 


The reality of the fields requires gj = g_x, py = p-x- Substituting the mode 
expansion and simplifying, (2 becomes 


2 = >> dpe \ 5q-+ 
k 


=i J da, A bax (20.141) 
k 
where we define 
a, = La ipr) oe +ip_p) (20.142) 
Ke V2 q—k Pk) k V2 Qk ‘D—k . 


This shows the Kahler structure and we can carry out the holomorphic quan- 
tization as in the case of coherent states. 

A somewhat more interesting example which illustrates the use of the 
symplectic structure w;;(x, x’) is the light-cone quantization of a scalar field. 
We introduce light-cone coordinates, corresponding to a light-cone in the 
z-direction as 


u= ate +t) 
it 
=e- t) (20.143) 


Instead of considering evolution of the fields in time t, we can consider evo- 
lution in one of the the light-cone coordinates, say, u. The other light-cone 
coordinate v and the two coordinates z7 = x, y transverse to the light-cone 
parametrize the equal-u hypersurfaces. Field configurations y(u,v,z,y) at 
fixed values of u, i.e., real-valued functions of v,z, y, characterize the trajec- 
tories. They form the phase space of the theory. The action can be written 
as 


S= je dv dat [-O.~d,¢ — $(Ary)? — U(¢)] (20.144) 


Again from the variation of the action S, we can identify the canonical 
one-form A as 


A= / di x? (ae) (20.145) 


(This was denoted by © in Chapter 3.) The symplectic two-form is given by 
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1 
Q= f dua Qv, 27, v',x'7) 5p(v, 27) A dy(v', 2’) 
Q(v, 27, v', aT) = —2 d,5(v — v')S) (a? — g'T) 
dp = dv d?zT (20.146) 


From this point on, the calculation is identical to what we did in Chapter 3. 


The fundamental Poisson bracket can be written down from the inverse to 
2. Writing 


oy a 
d(v —v') 62) (aT —2'T) = ae ea ipa(v—v')—ip? (2? —2'T) (20.147) 


we see that 


a Bp i 


exp(—ipy(v — v') — ip? - (a7 — 2'7)) 


~ 2) (2n)3 ip 
= e(v —v') 6) (a? — 2'T) (20.148) 


Here e(v — v’) is the signature function, equal to 1 for v— v'’ > 0 and equal 
to —1 for v—v' <0. 

The phase space is thus given by field configurations y(v,z7) with the 
Poisson brackets 


{o(u0,27),o(uy0',a'7)} = Fe(v—v') 6(e? 27) (20.149) 


The Hamiltonian for u-evolution is given by 
i 
H= [ de d27 | 5(@re)? ae u(e)| (20.150) 


The Hamiltonian equations of motion are easily checked using the Poisson 
brackets (20.149). 

Quantization is achieved by taking y to be an operator with commutation 
rules given by i-times the Poisson bracket. 


20.6.7 The Chern-Simons theory in 2+1 dimensions 


The Chern-Simons (CS) theory is a gauge theory in two space (and one time) 
dimensions. The action is given by 


io Tr [aaa 2 549 
4n =x [ti,t;] 3 
k 


2 Pa ? Tr |4,0-4o + A.A, Aa| (20.151) 
An =x ([ti,tz] 3 
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Here A,, is the Lie-algebra-valued gauge potential, A, = —it*A/. ¢® are 
hermitian matrices forming a basis of the Lie algebra in the fundamental 
representation of the gauge group. We shall consider the gauge group to be 
SU(N) in what follows, and normalize the t¢ as Tr(t?t’) = 56%. kisa 
constant whose precise value we do not need to specify at this stage. We 
shall consider the spatial manifold to be some Riemann surface »' and we 
shall be using complex coordinates. The equations of motion for the theory 


are 
F,, =0 (20.152) 


The theory is best analyzed, for our purposes, in the gauge where Ao 
is set to zero. In this gauge, the equations of motion (20.152) tell us that 
eh (Ai +%Ag) and Az = $(Ai — 7A 2) are independent of time, but must 
satisfy the constraint 


F3, = Oz A, a O,Az se [Az, Az] a) (20.153) 


This constraint is just the Gauss law of the CS gauge theory. 
In the Ap = 0 gauge, the action becomes 


Se - / did (Auta), ae (20.154) 
For the boundary term from the variation of the action we get 
k ty 
5S = —— | Tr(AzdA,) (20.155) 
TJs iG 


We can now identify the symplectic potential as 


A=- = Tr(AzdA,) + dp[A] (20.156) 
D3, 
where p[A] is an arbitrary functional of A. The freedom of adding dp is the 
freedom of canonical transformations. As in the case of the scalar field, 6 is 
to be interpreted as denoting exterior differentiation on A, the space of gauge 
potentials on 2’. A is also the phase space of the theory before reduction by 
the action of gauge symmetries. (The space of gaueg potentials was denoted 
by A in Chapter 16; here we use A to avoid confusion with the symplectic 
potential.) 
The symplectic two-form 2 is given by 6A, i.e., 


ik k 
2 =S- - a = 
= [ Tr(5Az6A.) = -— [ Tr(5A A 6A) 


= tk ax AG 
= i dAL5 AS (20.157) 
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(We will not write the wedge sign for exterior products on the field space 
from now on since it is clear from the context.) 

The complex structure on induces a complex structure on Arsabuggmaies 
can be taken as the local complex coordinates on A. Indeed we have a Kahler 
structure on A; 92 is k times the Kahler form on A and we can associate a 
Kahler potential K with, given by 


= Fa i At Aa (20.158) 
Py; 


Poisson brackets for Az , A, are obtained by inverting the components of 2 
and read 


{A(z), A?,(w)} =0 
{A§(z), A, (w)} =0 
{A2(z), Ae, (w)} = — AEE gor) Ca (20.159) 


These become commutation rules upon quantization. 
Gauge transformations are given by 


A9 = gAg™' — dgg™ (20.160) 


Infinitesimal gauge transformations are generated by the vector field 


oa |. 0) i + (D:6)*; nal (20.161) 


where D, and D; denote the corresponding gauge covariant derivatives. By 
contracting this with 2 we get 


ik 
} = —d Poe" 20.162 
ga = -3/% | rae] (20.162) 
which shows that the generator of infinitesimal gauge transformations is 
G= =e (20.163) 


Reduction of the phase space can thus be performed by setting F' to zero. 
This is also the equation of motion we found for the component Ag. Notice 
also that 


(AI) — A(A)=6 E if Tr(g~ 16g a) (20.164) 


(In the second term F is the two-form dA + AA A.) 
The reduced set of field configurations are elements of A/ G, where G, 
denotes the group of gauge transformations, G, = {g(z) : © — G}. 
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The construction of the wave functionals proceeds as follows. One has to 
consider a line bundle on the phase space with curvature §2. Sections of this 
bundle give the prequantum Hilbert space. In other words, we consider func- 
tionals [A,, Az| with the condition that under the canonical transformation 
AA + 6A, &— e4). The inner product on the prequantum Hilbert 
space is given by 


ra 


(112) = ff dys, Az) @j[As, As] OalAe, Ad (20.165) 


where du(A,, Az) is the Liouville measure associated with 92. Given the 
Kahler structure, this is just the volume [dA,dAz] associated with the metric 
\OAl|? = fy JAZ5Az. 

The wave functionals so constructed depend on all phase space variables. 
We must now choose the polarization conditions on the #’s so that they 
depend only on half the number of phase space variables. This reduction 
of the prequantum Hilbert space leads to the Hilbert space of the quantum 
theory. Given the Kahler structure of the phase space, the most appropriate 
choice is the Bargmann polarization which can be implemented as follows. 
With a specific choice of p[A] in (20.156), the symplectic potential can be 
taken as 


ik ik i ee ere 
Ae P Tr(Az6A, — A,6Az) = es (Age As — ASOAZ) (20.166) 
The covariant derivatives with A as the potential are 
ye) ks =_,9 Kaa 
V (SAa + 7A?) V (572 ee) (20.167) 
The Bargmann polarization condition is 
V G=0 (20.168) 
or 
Kf jaya a aK gaa 
e—. “exp aor AZAL | [Ag] = e 2" eg) (20.169) 


where K is the Kahler potential of (20.158). The states are represented by 
wave functionals 7[A$%] which are holomorphic in A%. Further, the prequan- 
tum inner product can be retained as the inner product of the Hilbert space. 
Rewriting (20.165) using (20.169), we get the inner product as 


(1/2) = / [dAg, AZ] ec MAEAD) opt abo (20.170) 
On the holomorphic wave functionals, 
2x 6 
Az pl Ag] = nay [Ag] (20.171) 
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As we have mentioned before, one has to make a reduction of the Hilbert 
space by imposing gauge invariance on the states, i.e., by setting the generator 
F% to zero on the wave functionals. This amounts to 


6 k a a a } 
(p. eV 70.42 p[A2] = 0. (20.172) 


Consistent implementation of gauge invariance can lead to quantization 
requirements on the coupling constant k. For nonabelian groups G this is 
essentially the requirement of integrality of k based on the invariance of e** 
under homotopically nontrivial gauge transformations. We now show how 
this constraint arises in the geometric quantization framework. Consider first 
the te theory on ©’ = S?. The group of gauge transformations G = 
{g(x): S? — G}. Obviously Ho(G) = I2(G) = 0 and 1,(G) = I13(G) = Z. 
aaa! one has I7;(A/G) = 0 and IIz(A/G) = Z. The nontriviality 
of IT2(A/G) arises from the nontrivial elements of I7,(G). Therefore, consider 
a noncontractible loop C' of gauge transformations, 


C = g(z,A), 0<rA<1 
g(x,0) = g(z,1) = 1 (20.173) 


We can use this to construct a noncontractible two-surface in the gauge- 
invariant space A/ G: 

We start with a square in the space of gauge potentials parametrized by 
0 < A,0 < 1 with the potentials given by 


A(z,A,0) = (gAg-'— dgg')o + (1-a)A (20.174) 


The potential is A on the boundaries \ = 0 and 4 = 1 and also on o = 0. It is 
equal to the gauge transform AY of A at o = 1. Since A9 is identified with A in 
the quotient space, the boundary corresponds to a single point on the quotient 
A/G and we have a closed two-surface. This surface is noncontractible if we 
take g(x, \) to be a nontrivial element of [3(G) = Z. We can now integrate 
2 over this closed two-surface; for this calculation, we may even put A equal 
to zero and use A(x,\,0) = —odgg~!. We then have 6A = —dodgg™* — 
ad(égg~') —a|5gg~1, dgg—*]. Using this in the expression for 2 and carrying 
out the integration over o, we get 


[e=Z [Mesa 6007 
=75- | Ir(dag*)° 
= —~InkQlal (20.175) 


where, in the second step dgg~! = 0;gg~1dx'+0gg~ 1d); we include differen- 
tiation with respect to the spatial coordinates and with respect to the internal 
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coordinate A. Q{g] is the winding number (which is an integer) characterizing 
the class in J7,(G) = II3(G) to which g belongs; it is given by 


Qlol =-sr5 f Maso)’ (20.176) 


From general principles of geometric quantization we know that the integral 
of 2 over any closed noncontractible two-surface in the phase space must be 
an integer, see (20.58). Thus (20.175) and (20.176) lead to the requirement 
that k has to be an integer. This argument can be generalized to other choices 
of 2’. : 

The situation for an Abelian group such as U(1) is somewhat different. 
Consider the case of G = U(1) and with © being a torus S! x S!. This 
can be described by z = €; + T&2, where &1, 2 are real and have periodicity 
of & — &+ integer, and 7, which is a complex number, is the modular 
parameter of the torus. The metric on the torus is ds? = |dé, + Td&2|?. The 
two basic noncontractible cycles (noncontractible closed curves) of the torus 
are usually labeled as the a and @ cycles. Further the torus has a holomorphic 


one-form w with 
/ wot i ts) ap (20.177) 
a B 


Since w is a zero mode of Oz, we can parametrize Az as 


TW 


Az = 0; L 
3 ie 


a (20.178) 
where a is a complex number corresponding to the value of A; along the zero 
mode of 0,. This is the Abelian version of (17.60). 

For this space, I[o(G) = Z x Z, by virtue of gauge transformations gmin 
with nontrivial winding numbers m,n around the two cycles. Consider one 
connected component of G, say, Gmn. A homotopically nontrivial U(1) trans- 
formation can be written as gmmn = e e™™, where \(z, Z) is a homotopi- 
cally trivial gauge transformation and 


Be [fm / ‘(@-a) ih “(a -70)] (20.179) 


With the parametrization of Az as in (20.178), the effect of this gauge trans- 
formation can be represented as 

X ae 

a>a+m+ nr (20.180) 


The real part of x can be set to zero by an appropriate choice of ». (The 
imaginary part also vanishes when we impose the condition F,; = 0.) The 
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physical subspace of the zero modes is given by the values of a modulo the 
transformation (20.180), or in other words, 


ill 


Physical space for zero modes = C 
Cc 
———— 20.181 

Z+7Z ( ) 
This space is known as the Jacobian variety of the torus. It is also a torus, 
and therefore we see that the phase space C has nontrivial I7; and H?. In 
particular, [;(C) = Z x Z, and this leads to two angular parameters yy and 
ye which can be related to the phases the wave functions acquire under the 
gauge transformation g;,;. The symplectic two-form for the zero modes can 
be written as 


g= Fore | woAw 
~ 4 Imr Js Imr 
kr da A da 
a Ai 
ica (20.182) 


Integrating the zero-mode part over the physical space of zero modes C, we 
get 


i Q=kr (20.183) 
(& 


showing that k must be quantized as an even integer for U(1) fields on the 
torus due to (20.58). (The integrality requirement on 2 arises from the use 
of wave functions which are one-dimensional, i.e., sections of a line bundle. 
If we use more general vector bundles, this quantization requirement can be 
relaxed; however, the probabilistic interpretation of such wave functions is 
not very clear.) 

The symplectic potential for the zero modes can be written as 


mk (@ — a)(rda — Tda) © 


=— 20.184 
. 4 (Imr)? ( ) 
The polarization condition then becomes 
0  .«k(@—a)r 
— +i——> a0} 20.185 
E ane 4 (Imr)? | y ( ) 


with the solution 
= a area a 20.186 
yao |i FRA Fa) (20.186) 
where f(a) is holomorphic in a. Under the gauge transformation (20.180) we 
find 


ro NP re krn? 
oe eee py —| ae 


(20.187) 
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Under this gauge transformation A changes by dA, where 


wkn(ta — 7a) 
Amn = ‘Sa (20.188) 
The change in 7 should thus be given by exp(tAm,n)¥; requiring the trans- 
formation (20.187) to be equal to this, we get = “““*"” 


f(a+m-+nr) = exp ee - rikna| f(a) (20.189) 


x 


This shows that f(a) is a Jacobi O-function. On these, @ is realized as 
(2 Imr/k7)(0/0a) + a. The inner product for the wave functions of the zero 
modes is 


2Imr 4Imr 4Imr 


(f\g) = [exp |- mae - ul ae | fg (20.190) 


It is then convenient to introduce the wave functions 


tka 
r= [3] 10 
aka? 
= exp E = O(a) (20.191) 
On these functions, @ acts as 
ae IO 
Te ae (20.192) 


20.6.8 6-vacua in a nonabelian gauge theory 


Consider a nonabelian gauge theory in four spacetime dimensions; the gauge 
group is some compact Lie group G. We can choose the gauge where Ap = 0 
so that there are only the three spatial components of the gauge potential, 
namely, A;, considered as an antihermitian Lie-algebra-valued vector field. 
The choice Ap = 0 does not completely fix the gauge, one can still do gauge 
transformations which are independent of time. These are given by 


A; — Al = gAig™! — dig g™* (20.193) 
The Yang-Mills action gives the symplectic two-form as 
pi [ae bE? 6A} 


= -2 i: d°x Tr (5E; 6A;) (20.194) 
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where E? is the electric field 0) A%, along the Lie algebra direction labeled 
by a. The gauge transformation of E; is E; — gE;g~1. The vector field 
generating infinitesimal gauge transformations, with g ~ 1 + y, is thus 


6 “) 
= jaan Dip)? — 1 9]°——— 
ga— [be [Dee sig + Ba ol 55 (20.195) 
This leads to 
igh = —6 i dx [—(Diy)* £2] (20.196) 
The generator of time-independent gauge transformations is thus 
G(y) = —- i) d°x (Diy)*E? (20.197) 


For transformations which go to the identity at spatial infinity, G(y) = 
f yp? G?, G° = (D,E;)*. G* = 0 is one of the Yang-Mills equations of motion; 
it is the Gauss law of the theory. In the context of quantization, this is to be 
viewed as a condition on the allowed initial data and enforces a reduction of 
the phase space to gauge-invariant variables. 

As discussed in Chapter 16, the spaces of interest are 


ug {sec of gauge potentials a} (20.198) 
G. = {sec of gauge transformations 9(x): R® > G 


such that g — 1 as |x| > oo} (20.199) 


The transformations g(x) which go to a constant element g.. # 1 act as a 
Noether symmetry. The states fall into unitary irreducible representations 
of such transformations, which are isomorphic to the gauge group G, up to 
G,-transformations. The true gauge freedom is only G,. The physical config- 
uration space of the theory is thus C = A/G,. In Chapter 16, we also noted 
that G, has an infinity of connected components so that 


+oo 


C= Ye © Gig (20.200) 
Q=—co 


Q is the winding number characterizing the homotopy classes of gauge trans- 
formations. The space of gauge potentials A is an affine space and is topologi- 
cally trivial. Combining this with Io(G,.) = Z, we see that the configurations 
space has noncontractible loops, with I;(C) = Z. Our general discussion 
shows that there must be an angle 6 which appears in the quantum theory. 
We can see how this emerges by writing the symplectic potential. 
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The instanton number v[A] for a four-dimensional potential is given by 


y[A] = — fas Tr (Fur Fag) 4”? 


gon 
1 apa ijk 
= Ta / d‘x E? FS. €%* (20.201) 


a 
The density in the above integral is a total derivative in terms of the potential 
A, but it cannot be written as a total derivative in terms of gauge-invariant 
quantities. v[A] is an integer for any field configuration which is nonsingular 
up to gauge transformations. It is possible to construct configurations which 
have nonzero values of v and which are nonsingular; these are the instantons, 
also considered briefly in Chapter 16. 

We may think of configurations A(x, x4) as giving a path in A with 24 
parametrizing the path. v[A] can be written as 


=p Kidl 


K[A] = ‘i Br FS 5Azes* (20.202) 


The integral of the one-form K around a closed curve is the instanton number 
vy and is nonzero, in particular, for the loop corresponding to the instanton 
configuration. We can also see that this one-form is closed as follows: 


6K[A] = -2 / dx 5Tr [Fj~5 Ai] €9* 
=—4 / d°x Tr [(Dj;6Ax)5 Ai] €* 


ei / dx Tr [0,5Ax 5A; + (As, 6Ag]oAg] 
= (20.203) 


In the last step we have used the antisymmetry of the expression under 
permutation of 6’s, cyclicity of the trace, and have done a partial integration. 
We see from the above discussion that K[A] is a closed one-form which is not 
exact since its integral around the closed curves can be nonzero. 

The general solution for the symplectic potential corresponding to the 
symplectic two-form in (20.194) is thus of the form 


AS " Bx B26A9 +6 KIA] (20.204) 


Use of this potential will lead to a quantum theory where we need the pa- 
rameter 0, in addition to other parameters such as the coupling constant, 
to characterize the theory. The potential A in (20.204) is obtained from an 
action 
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1 
a ih d‘x Fa, Fe + 6 v[Al (20.205) 


Thus the effect of using (20.204) can be reproduced in the functional integral 
approach by using the action (20.205). Since the relevant quantity for the 
functional integral is exp(iS), we see that 6 is an angle with values 0 < @ < 2z. 
Alternatively, we can see that one can formally eliminate the @-term in A by 
making a redefinition Y — exp(i0A)W, where 


a 2 43 
Aes / Tr (aaa + = ) (20.206) 


Notice that 27 A is the Chern-Simons action (20.151) for k = 1. A is not invari- 
ant under homotopically nontrivial transformations. The wave functions get 
a phase equal to e’?@ under the winding number Q-transformation, showing 
that 6 can be restricted to the interval indicated above. This is in agreement 
with our discussion after equation (20.52). 


20.6.9 Current algebra for the Wess-Zumino-Witten (WZW) 
model 


The WZW action was introduced in Chapter 17 in the context of evaluating 
the two-dimensional Dirac determinant. One can think of the WZW action 
in its own right as defining a field theory in two dimensions. For this one 
uses the Minkowski signature; the dynamical variables are group-valued fields 
g(z°, x). The action is given by 


k 


k 
ee 2 —lop —1 83 , 
S(9) = — 5 [Pe W(Qua0*0"ag) + a= | TH(dag™*)* (20.207) 


The first term involves integration over the two-dimensional manifold M?;the 
second term, the Wess-Zumino (WZ) term, requires extension of the fields 
to include one more coordinate, say, s, and corresponding integration. We 
can take M° as a space whose boundary is the two-dimensional world, or we 
can take M? = M? x [0,1] with fields at s = 1 corresponding to spacetime. 
Different ways of extending the fields to s # 1 will give the same physical 
results if the coefficient k is an integer. This quantization requirement arises 
from the single-valuedness of the transition amplitudes or wave functions. 
This result was also shown in Chapter 17; we just note here that the WZ 
term, being a differential form, is not sensitive to the signature of the metric 
and so the argument presented in Chapter 17 will be valid in the Minkowski 
case as well. 

In the Minkowski coordinates we are using here, the Polyakov-Wiegmann 
identity becomes 


S(hg) = S(h) + S(g) — = / dx Tr(h~*d;hd;9 g~*)(n? + €7) (20.208) 


526 20 Elements of Geometric Quantization 


where 7’ = diag(1,—1) is the two-dimensional metric and e*? is the Levi- 
Civita tensor. By taking small variations, h ~ 1+0, 0 < 1, this identity gives 
the equation of motion 


(80-91) (809 97 * + Aig g"') =0 (20.209) 
This is also equivalent to ern ae 
(00+) (97809 —g7'A1g) =0 (20.210) 


There are two commonly used and convenient quantizations of this action 
which correspond to the equal-time and lightcone descriptions. There is a 
slight difficulty in obtaining the symplectic potential from the surface term 
resulting from time-integration. This is because the expression for the WZ 
term is written for a spacetime manifold which has no boundary, so that it 
can be the boundary of a three-volume. The variation of of the WZ term can 
be integrated to give 


6Iwz = - / _ Ir(699'T’) (20.211) 
M 


where I = dgg~. Reintegrating this over the parameter s, we get the form 
of the WZ term written on M? x [0,1], 


1 
Iwz= pall as [ Tr [0.997 "I? (20.212) 
An 0 M2 


We will use this form to identify the symplectic potential. 
First we consider the equal-time approach. The action, separating out the 
time derivatives, is 


k 
on) = aa i dx Tr(Sogg”*)? + Iwz (20.213) 
TJ M2 


If we vary this and look at the surface term from the integration over time, 
we get the symplectic potential as 


Oi -= / dx Tr(€Ip) + = i Tr(€I?) (20.214) 


Here € = 6gg™!, and the last term still has integration over s as well as x. 
Exterior derivatives are given by 


— 
bh, = 06 +61, —hé (20.215) 


Upon taking exterior derivatives, the second term in A becomes a total deriva- 
tive and can be integrated over s as follows. Using J? = dI, 
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sp frery= = f wler—eaaerer-te) 


— = | Tr [—d(é? 
us 
k 
== | Blas) +a,n) 
k 2 
=a / Tr [é771] (20.216) 
The symplectic two-form is now obtained as 
k 
a ~ ie fe oY [€7Io a 7; a E61o| (20.217) 


Notice that Jo must be considered as an independent variable, as is usually 
done in equal-time quantization. 

Consider a vector field V|(@) whose interior contraction has the effect of 
replacing € by 0. If we expand £ = dgg~! = (—it*)Eandx°, we can explicitly 
write 


Vi (0) = i, dx (Biel 5 (20.218) 


The action of V\(@) on g is to make the left translation Vi (0)g = (—it*)6%g. 
We also define 


V2(0) = dx f2°°9° (Ip agp. (20.219) 


61g 


which has the effect of replacing 6J by [0, Jo — )] upon taking a contraction 
with 2. The contraction of V = V; + V2 with 22 gives 


apa 
iy = -5 fae E | 


= —6Jo(6) (20.220) 


Thus V is a Hamiltonian vector field corresponding to Jo. 
Another vector field of interest is 


6 
W(6) = ‘| de (0,0 + FOS) (20.221) 
0 
This has the effect of replacing 6[p by 0,0 + [0,1]. Contraction with 2 gives 


ina—b fae [Lie 


= —6J;(8) (20.222) 


Thus W is a Hamiltonian vector field corresponding to J1. 
The currents of interest are, once again, 
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a k —-1 ‘ 
Ji = By (On99 ) (20.223) 


For the Poisson brackets, we find 


{J0(0),Jo()} = ~ivbJo(p) = tv f ae 15 (#19) 


k 
rE fe Tr (0, Io — ly 


II 


= Jo(0 Sis Oe ae (20.224) 
{4(0),n(e)} = iv fae tr (Esng) 

cone a0" 2 (20.225) 
{41(0), Ja(9)} = iw / fone (Fsne) 

= (20.226) 


In these equations (9 x y)* = f2°°@°y°. These can be combined to yield 
k 
{J+(8), J+ (9) } = J+ (8 x G) — F_AO%® (20.227) 
where J, = Jo+ J. The classical Hamiltonian is given by 
An 
H=— " dx [JOS + J2.J2] (20.228) 


In the lightcone quantization, we introduce coordinates u, v, 


1 1 
sien saee = ae) 
= oem 8y = (8 +81) (20.229) 


v2 v2 


This is different from (20.143), but given the structure of the equations of 
motion (20.209), these are easier. The action becomes 


k 
S= ae jf ®@.99-2.997) +Iwz (20.230) 
We take v as the analog of the spatial coordinate and consider evolution 


in u. The surface term for u-integration will arise from variations on the 
0,997 '-term. This gives 


k 
A=-= [ HEA99) += | _Trelaoo™")) (20.231) 
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Once again, the last term still involves the s-integration, but upon making 
another variation, we can integrate this as before and get 


k 
Q= / dv Tr [O,€ + 2€? I,] (20.232) 


The equation of motion is 0,f, = 0. The contraction of V;(6) as defined in 
(20.218) gives 


Pipi - ih dv Tr [200,£ + 20€I, — 2¢01,] 
k 
= = fw WOE +, HDI 


kI29* 
=—/ aw 
fe | ie | 


ves, (6) (20.233) 


The current J, is given by 


Ga= kK —l\a 
Jy = 4 (Ovgg) (20.234) 


The Poisson brackets are given by 
+ 
{Ju(0), Jo(@)} = tv, = f TH(SIo) 


k 
a= = ft [0,09 + Oyp — Iy6¢| 


-E [rw 5 | este" 


=J,(0 oy ie M fa yp (20.235) 


The algebra of currents given by (20.227), or the light-cone version 
(20.235), is an example of a Kac-Moody algebra. 
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Appendix:Relativistic Invariance 


A-1 Free point-particles and the Poincaré algebra 


In this appendix we shall briefly discuss the relativistic wave functions of 
free particles. The symmetry operations relevant to relativistic invariance 
are translations (in space and time) and Lorentz transformations (including 
spatial rotations). To obtain the wave functions, we need a Hilbert space 
on which we can realize translations and Lorentz transformations as unitary 
transformations. In other words, we need a unitary irreducible representation 
(UIR) of the algebra of translations and Lorentz transformations. The re- 
quirement of unitarity is clear since all physical observables generate unitary 
transformations on the Hilbert space in quantum mechanics. The qualifica- 
tion of irreducibility is a little more subtle, so we take a moment to recall 
that in nonrelativistic quantum mechanics we have a similar situation. For 
one-particle dynamics in one dimension, the relevant operators are £ and p 
with the commutation rules 


[z, 2] =0, Ip, p = 0, [z, Dp] =1 (A-1) 


We need a unitary representation of this algebra of observables. One way to 
represent the operators Z,p is as follows. We can consider complex-valued 
functions (x, p) with the normalization condition 


[ doae w(e,0)" (ap) =1 (A) 
The operators act by 
AA ie 
Dl ae 
tu= (iS +2) (A-3) 


Clearly, this gives the wrong quantum mechanics since we are specifying the 
wave functions as functions of x and p. We can also see that this representa- 
tion is reducible. In fact, we see that we can impose a condition 


Ob : 
mp 7° (A-4) 
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on the wave functions and still obtain a representation of (A-1). In particular, 
in this case, 


py =-iSy, ty =a (A-5) 
The normalization condition can now be taken as 
fav =n (A-6) 


This is the standard Schrédinger representation. Since we are able to ob- 
tain a representation on this smaller space of functions obeying the condition 
(A-4), the former representation (A-3) is reducible. The Schrodinger represen- 
tation can be shown to be irreducible; i.e., there is no smaller function space 
on which the algebra (A-1) can be realized. (Properly speaking, one should 
consider bounded operators obtained by exponentiation.) We thus see that 
quantum mechanics may be identified as a unitary irreducible representation 
(UIR) of the algebra of observables. (For the algebra (A-1), there is only one 
representation, the Schrodinger representation, up to unitary equivalence, a 
result due to Stone and von Neumann. With an infinite number of degrees 
of freedom, or on nonsimply connected spaces, there can be many UIR’s 
and the physical consequences can be different depending on which UIR one 
chooses. This is not an issue of immediate relevance to our discussion; we 
have discussed some of these issues in Chapters 12, 15 and 20.) 

Returning to the question of the relativistic point-particle, we consider the 
symmetry transformations in some more detail. The action for the motion of 
a relativistic point-particle is the mass m times the proper distance, or 


S= je Vi am | Sider” (A-7) 


The metric tensor in the above expression is the Minkowski metric with 
Cartesian components 709 = 1, nj; = —6;; and all other components being 
zero. The symmetries of the theory are clearly the symmetries of the proper 
distance ds = 4/n,,dr"dz”. We first consider the continuous symmetries 
which can be understood in terms of infinitesimal transformations. We write 


ge g'# =o 4 EH 7 (A-8) 


where €¥ is infinitesimal. The requirement that this be a symmetry transfor- 
mation of the proper distance gives 


O£° Of 
Now aa oe Nea a =0 (A-9) 


The solution to this condition is given by 


Co =a er (A-10) 
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where a4, w#” are constant parameters and wt” = —w’". w#” are the six pa- 
rameters of Lorentz transformations, w® being the relative velocities of the 
frames connected by the transformation (A-10). 6° = de7*w;, are the angles 
of spatial rotations and a“ are the parameters of translations. For any func- 
tion of x’, we can write the generators of the transformations immediately 


from (A-8, A-10). 


a of of 
5 i ee ee [BY a 
He) = OV aget = Mae + ON fv ac 
~ ( a" P,, — su Mus ) f (A-11) 
where 3 
P, = aa? My = Sey cae TpPu (A-12) 


(The factors of 7 are convenient for the sake of hermiticity; these operators 
will be interpreted later as physical quantities.) The commutation rules of 
these operators are easily worked out to be 


[Pus P,| =0 
[Muv, Pel a Naat, = Nov Pu) (A-13) 
[Muv, Mag] =1 (NuaMip mS vp Myo a NupMva _ NvaM yg) 


This algebra of translations and Lorentz transformations is the Poincaré al- 
gebra. Although we obtained this algebra by considering the infinitesimal 
transformations on a scalar function, it is of general validity. One can also 
characterize particles with spins in terms of this algebra. 

For example, for a vector-valued function A,(x), the transformation 
rule can be worked out by treating A,dz“ as a scalar; ie., dA,dz* = 
A,,(z’) dx’* ~ A,(x)dz". Explicitly, 


_ pe aan eee 
aE = Ox” ‘i Ore” 
= (iar — 5" Map Ay (A-14) 
Bb 
where 
_ Oo 
Fa = "ara 
Mag = (tePs ~ wala) + Sap 
(Sap )yx = —i(Nuat%p — Mons) (A-15) 


Sap is to be thought of as a 4 x 4-matrix, a, 3 specifying the type of Lorentz 
transformation we are interested in and p,v specifying the matrix elements. 
Equation (A-15) is in matrix notation, i.e., expressions likerdg i tas — 
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zgP, are proportional to the identity matrix 67. One can easily verify that 
P,, Mag in (A-15) obey the same commutation rules as in (A-13), with 
matrix multiplication understood for Sag. Sag is the spin contribution to the 
Lorentz generators. 

For a constant vector k,, the Lorentz transformation is generated en- 
tirely by S,,. The finite transformation is given by the composition of many 
infinitesimal transformations as 


: i wo . 
i (- 55756) | . 


py 
= lexm(—ju"*So9| ky z (A-16) 
bb 
= Ly Ky 
The matrix Li, obeys 
Le EE ae ine ; (A-17) 


and further det L = 1, L8 = 1. (These are sometimes further qualified 
as proper orthochronous Lorentz transformations. Improper transformations 
can have a determinant equal to —1; they can be understood in terms of 
composing the proper orthochronous transformations with discrete transfor- 
mations such as parity and time-reversal. The discrete transformations which 
are also symmetries of the proper distance will not be discussed here; they are 
most easily understood in the functional integral language and are explained 
in chapter 12.) 

The Poincaré algebra (A-13) is the basic algebra of observables for free 
point-particles. From our understanding of quantum mechanics, we can thus 
obtain free-particle wave functions by studying the unitary realizations of the 
Poincaré algebra. P,,, being the generator of translations, will be the four- 
momentum of the particle, Py = H being the Hamiltonian. M;; will generate 
spatial rotations and hence J‘ = Se%*M,;, is the angular momentum. Mo; = 
K; generate Lorentz transformations connecting frames of reference which 
are in relative motion; these are the so-called boosts. One can decompose the 
Poincaré algebra in terms of these components as follows: 


[Pi, Pj] = 0, [P;, H] =0 
[Feeka cae Eijne’ [Ji, H] =) 
[Ki,P;]=71 6H,  [Ki,H] =iP, 


(Ji; Jy) a eee (A-18) 
[J;, Kj] — a €ijhK* 
Eee] = a ha 
In arriving at these equations, it is useful to remember that e%* is the nu- 
merical Levi-Civita symbol, so that €7*¢#e> = §3a5kb _ gibgka, 
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A-2 Unitary representations of the Poincaré algebra 


We begin the construction of unitary realizations of the Poincaré algebra by 
noticing that the P, commute among themselves and so can be simultane- 
ously diagonalized. We can thus define a set of states, |p), depending on a 
four-vector p, such that, 


Py\p) = pulp) (A-19) 


Now, from the commutation rule (A-13), we can work out the finite Lorentz 
transformation of P,, as 
P, =Lw)i PB (A-20) 


(A-13) is the infinitesimal version of this result with P), = U(w)P,U7~"(w), 
i) exp(—$w"” Myv). For the eigenvalue four-momentum p,, we thus 
have p), = Lipy. 

From the transformation law for P,,, it is clear that P? is invariant under 
Lorentz transformations, and since the P’s are commuting operators, P? 
commutes with all P,, and M,,,; i.e., it is a Casimir operator for the Poincaré 
algebra. The possible representations can therefore be specified by the value 
of P?. The possibilities are 


1. P?>0,  say,,.P? =m? 
2 Ff =<0 
3. P? =0. 


P? > 0 will describe massive particles of mass m. (In fact the condition 
P? = m? may be taken as the definition of the mass m.) P? < 0 is unphysical, 
corresponding to propagation faster than light. P? = 0 will describe massless 
particles such as the photon. 

There is also another Casimir operator for the Poincaré algebra. This is 
given as W?, where W+ is defined as 


W# = &#¥°8 P Mag (A-21) 


W+# is called the Pauli-Lubanski spin vector. We can use W? to characterize 
the representations further. However, instead of analyzing the possible values 
of W? in general, we shall simply analyze the representations of interest and 
calculate W? for these. 

Given that p? = m? or zero, for the states |p), we have po = +./p? + m? 
(or po = +,/p"). It is easy to see from the Lorentz transformation of p, that 
L” do not change the sign of po. Therefore we further characterize represen- 
tations by the sign of po; for the physically interesting cases (with positive 
energy), we have 


p = 0, Po SS 0. (A-22) 


538 Appendix:Relativistic Invariance 


The states |p) are thus labeled by the three-emomentum p, with po = 
/p? +m? (or po = Vp? for the massless case). These states for all of p- 
space obeying the conditions (A-22) should give us a complete set of states. 
Because (A-22) are Lorentz-invariant, we choose a Lorentz-invariant measure 


of p-integration as 
dp 
dy.(p) = (an)3 


(with m = 0 for the massless case). The po-part of the integration with this 
measure is trivial; carrying out this integration, we get, instead of (A-23), 


Tong 6(p” — m”)O(pé) °¢ (A-23) 


dp 1 


(27r)3 Qu, (oe 


du(p) = 


where w, is the positive square root solution for po, wp = \/p*? +m?; we 
integrate over all of p. The completeness condition for the states |p) can be 


written as 
fo Sax wl =1 (A-25) 


Since P; is hermitian, we must have orthogonality of states of different val- 
ues of p. The orthonormality condition corresponding to the completeness 
relation (A-25), is 


(plp’) = (21)? 2up 5°) (p — p’) (A-26) 


A-3 Massive particles 


So far, we have only specified the action of the momentum operators P,,. We 
must now specify how Lorentz transformations act on |p). For this we follow 
the procedure of Wigner. The strategy is to go to a special frame, suitably 
chosen, construct a representation there and bring it back to a general frame 
by appropriate Lorentz transformations. To begin with we must thus choose 
a special p,, obeying (A-22). For the massive case, which we shall discuss first, 
we can take 


Pu = pl?) = (m,0,0,0) (A-27) 


This corresponds to the rest frame of the particle. There is a special Lorentz 
transformation L(wo) = B(p), which gives p,, from po), ter 


a = Bp), (A-28) 
Actually there are many choices for B. Explicitly, one choice is given by 


By = ov — Pat mm)(e" + mi") , pun” 


aed te m(po +m) m Coe) 


where n,, is a fixed vector with no = 1, nj = 0. 
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There are still some transformations we can do in the rest frame. The 
transformations L(w) such that L(w)p) = p\ form the little group or the 
isotropy group of p). (Here (Lp), = als we use an obvious matrix 
notation.) In our case, it is evident that these are the spatial rotations R(w) = 
L(w), with w® = 0. (Clearly such rotations are an ambiguity in the choice of 
B also.) Unitary representation of these rotations is well known; it is standard 
angular momentum theory. A representation is characterized by the highest 
value, denoted s, of a component of angular momentum, say, J3. There are 
(2s + 1) states and we have the transformation rule 


U(8) |s,n) =>) Din (8) |s,n') (A-30) 
a 
Here 6 = 1ekw,,. The D'°), (9) are the standard Wigner D-matrices of angu- 
lar momentum theory; they obey the unitarity condition D'D = 1, in matrix 
notation. We may write them out as D'2),(8) = (s,n\e"J«|5,n’) where J, are 
angular momentum generators and |s,n) are the spin-sangular momentum 
states. 

The crucial observation is that the representation for the full Poincaré 
group can be obtained on states of the form |p, s,n). In other words, a basis 
for the Hilbert space is given by products of the form |p) @|s,n), where |s, 7) 
provide a unitary representation of the little group, in this case, rotations of 
p). The action of a general Lorentz transformation can be obtained from 
the action of rotations as in (A-30). For this one constructs, for every Lorentz 
transformation, a pure spatial rotation, called a Wigner rotation, given by 


Rw) = B-*(L(w)p) L(w) B(p) (A-31) 


Since R(Oy)p© = p, we see that the combination on the right-hand side 
of equation (A-31) is indeed a pure rotation. Thus for every w#”, we can 
associate rotation angles 64,, given by (A-31). The action of a general Lorentz 
transformation with parameters w”” can then be defined as 


U(w)|p, 8, n) a ae De (Ow (p, w)) |Lp, 8, n’) (A-32) 


Since the action of rotations on |s,7) is unitary, i.e., DID = 1, we see that 
(A-32) defines a unitary realization of Lorentz transformations. This can be 
explicitly checked with the scalar product (A-26). A representation such as 
(A-32) is called an induced representation since the action of Lorentz trans- 
formations is induced from the action of rotations. 

The nature of massive one-particle states is now clear. They are char- 
acterized by the spin s with (2s +1) polarization states. s determines the 
D-matrices to be used. The action of translations on such states is given 
by e-*P*, which is simply a factor e~*?* on the momentum eigenfunctions. 
Lorentz transformations, including rotations, act on the states as in (A-32). 
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It is instructive to work out the Wigner rotation explicitly for an infinites- 
imal Lorentz transformation. From (A-31) 


Rx1-—B1wB + (6B ") B (A-33) 


Working out the components, we find that 
; ; a 1 : 
RO aor ice Ocal i-to-i) + -+ (A-34) 


This identifies the Wigner rotation parameter as 
ijk ra = 


The Pauli-Lubanski spin operator, for pe? ) reduces to the angular mo- 
mentum operator, W° = 0, W* = mJ*. This explains why W* is called the 
spin operator. W? is given by m? s(s+1) for the representations appropriate 
to massive particles. 


A-4 Wave functions for spin-zero particles 


For a particle of spin zero, since s = 0, we do not have any nontrivial D- 
matrices for the transformations. The action of a finite translation by a4 on 
the state |p) is given by 


ee bye (p) (A-36) 


This shows that the z-space wave function for a particle of momentum p can 
be taken as 


p(x) = (z|p) =e", (A-37) 


sincer~> r+a is tepnaduaed by the action of e~*?’? as in (A-36). (Here px 
denotes p-z = p°x° — p-z.) w(x) is a Lorentz scalar. The scalar product 
for the z-space wave functions is taken as 


/ dx [tp(2)* (iby (2) — (iovp(x)*)vpr(w)] = 2p0(2n)25 (p — p') 


(A-38) 
The choice is dictated by the requirement of consistency with the normaliza- 
tion condition (A-26). The scalar product 


(1\2) = ‘ dar [Wt (Gove) — (iow) val (A-39) 


is easily checked to be Lorentz-invariant. 
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In our calculations in text, for simplicity of interpretation of creation and 
annihilation of particles, we have considered the particles to be in a cubical 
box of volume V = L°, with the limit V — oo taken at the end of the 
calculation. (In this limit, we will recover the full Lorentz symmetry as well.) 
In this case, the wave functions for a particle of momentum p can be taken 
as or 


f 2WpV 


We shall use periodic boundary conditions on the wave functions. Ultimately, 
of course, physical results should not be sensitive to the boundary behavior 
in the limit of V — oo; so this convenient choice should be fine. With periodic 
boundary conditions, i.e., up(z + L) = up(x) for translation by L along any 
spatial direction, the values of p are given by 


Up(Z) = (A-40) 


27n,; 
{= — A-41 
p Z (A-41) 


(n1,2,n3) are integers. The wave functions u,(x) obey the orthonormality 
relation 


/ dx [ut (éOptip) — (i0puS)ty'] = bpp (A-42) 

V 

where 6p,’ denotes the Kronecker 6’s of the corresponding values of n,’s, i.e., 
5p,p! a On, m4 Ong nt, Ong,nt, (A-43) 


In the limit of V — oo, we have 

3 

Op,p! — eae 63) (p — p') 
V 
d°p 

ee A-44 

= ‘i 4 (2m)8 (A-44) 
P 
The wave functions 7, (or up) are obviously solutions of the equation 


= (Vim (A-45) 


The differential operator on the right-hand side is not a local operator; it has 
to be understood in the sense of 


/av? ome? f (2) = [ok sor PVF mes) — (A-46) 


where 


eRe f(p) (A-47) 


f(z) = a a 
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One can, of course, define a local differential equation whose solutions are 
the wave functions (A-40). It is the Klein-Gordon equation 


(O + m?)p =0 (A-48) 


where 1 is the d’Alembertian operator, 0 = 0,0" = (00)? — V?. Equation 
(A-48), however, has, in addition to (A-40), solutions‘ofthe form e~*?* with 
po = —1\/p* +m?. At the level of one-particle wave functions, it is difficult 
to interpret such solutions. However, in quantum field theory, they can be 
interpreted consistently and the Klein-Gordon equation becomes the basis 
for the discussion of spin-zero particles. 


A-5 Wave functions for spin-5 particles 


For a spin-5 particle, s = 5; we have two spin states. Translations act as in 
(A-36) and the wave functions have two components, viz., 


Vp,r(z) = (z\|p, s = 37) (A-49) 


(r = 1,2.) There are now nontrivial D-matrices in the transformation law. In 
this case, they are the 2 x 2 rotation matrices 


(3) (6) = exp (Zor) (A-50) 


where o, are the Pauli matrices 


na(22). oe(2 gs ae(h 8) om 


The wave functions are solutions to the equation 


(Otel) AE te (a9 


For many examples of spin-5 particles of mass m, we also have antiparticles 
of the same mass. The antiparticle wave functions ¢,(x) obey equation (A- 
52), with ¢, in place of w,. One can then define a combined wave function 
W(x) by 


vr(z) ) 
Vicj=| 7, A-53 
= (Be a 
W(x) is a four-component column vector. It obeys the equation 
OW 
i aay, —V2 +m? (A-54) 


where 
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100 0 
ice. 0 O 

el) Casa) 
ei Cem 


This is one version of the Dirac equation for spin-5 particles, the so-called 
Foldy-Wouthuysen representation. As with spin-zero particles and the Klein- 
Gordon equation, one can seek a local differential equation for W(x). The local 
equation is the usual version of the Dirac equation. Instead of transforming 
(A-54) to a local form, it is easier to show that (A-54) follows from the Dirac 
equation. 


A-6 Spin-1 particles 


For spin s = 1, we have three components and the wave functions have the 
form 


bp,r(x) = (x|p,s = 1,r) = e-"??e,(p) (A-56) 


(r = 1,2,3.) €, transforms as a vector under rotations, i.e., infinitesimally 
O€, = WyKeE (A-57) 


€, behaves like the spatial components of a four-vector. A local spin-1 analog 
of the Klein-Gordon equation might look like 


(0+ m?)y,(x) = 0 (A-58) 


The functions (A-56) are evidently solutions to (A-58) for any ¢,(p). This 
equation is, however, not manifestly covariant since only spatial components 
of a four-vector are involved. A local, manifestly covariant equation would be 


(+m), (x) =0 (A-59) 


(4 = 0,1, 2,3.) However, w, has one more component, namely wo, more than 
we need. Thus if one would like to have a local manifestly covariant equation, 
one can use (A-59), but must impose additional constraints on w,, to eliminate 
the unwanted degree of freedom. Alternatively, one may choose an equation 
different from (A-59) with extra symmetries which help us to eliminate the 
unwanted degree of freedom. These extra symmetries are gauge symmetries. 
They arise, from the particle point of view, because of the mismatch of the 
number of physical spin states, namely, (2s+1), and the number required for a 
suitable Lorentz vector or tensor in terms of which a local manifestly covariant 
equation can be constructed. (Gauge symmetries also have a deep geometric 
interpretation, as discussed in text.) Gauge symmetries are required for all 
spins s > 1. 
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A-7 Massless particles 


Although we derived the Poincaré algebra as the symmetry of the action for a 
massive point-particle, it holds for a massless particle as well. This can be seen 
by considering the symmetry of the equations of motion rather than the action 
or by noting that massless particles obey ds = 0. (They. follow null geodesics 
on a general spacetime.) Thus massless particles are also described by the 
UIR’s of the Poincaré algebra. In this case, we have p? = 0 and po > 0. We 
do not have the possibility of going to the rest frame of the particle. A general 
solution to p? = 0, po > 0 can be constructed by Lorentz transformations of 
a special vector po = (1,0,0,1). In our discussion of massive particles, we 
considered the transformations which left pr” = (m,0,0,0) invariant, viz., 
rotations. From the action of the rotations, via the use of Wigner rotations, 
we could obtain the action of a general Lorentz transformation on the states. 
One can do a similar construction for massless particles by considering the 
transformations which leave the vector py = (1,0,0, 1) invariant or in other 
words the isotropy group of this vector. Infinitesimally, these transformations 
are given by those w4” which obey w'” pl” = 0. One can check that a general 


Lorentz transformation which preserves po is of the form 


U wi + i (w® (Mor + Miz) + w°*(Mo2 + Mo3) + w'*Mi2) 
1 + i(w™T, +w™T> + w*My2) (A-60) 


T; = Mo; + Miz, i = 1,2, and M2 generates rotations around the direction 
of the spatial momentum p, in this case the z-axis. The commutation rules 
among these operators are 


[Mi2, Tz] = +Ti, [T,,T_] =0 (A-61) 


where Ty, = T; +17. We can also check that for the special choice of po, 
W? is given by 
W?2 =T.T_ (A-62) 


One class of states and representations is obtained by 
T+\p, r) = 0, figs Ip, d) = 0, Mia\p, r) = Alp, r) (A-63) 


This has W? = 0. In this case, since both W? and p? are zero, and W-p =0 
from (A-21), so we must have W¥ proportional to p”. In fact, WY = Ap". 
This equation gives an invariant definition of \. We may write 


N= = (A-64) 


A is called the helicity of the particle. Since Mj2 generates rotations, in- 
variance under w’? — w1? + 27 gives \ = 0, +1, ..., for single-valued 
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representations and \ = +3, a 3 ..., for the double-valued representations. 
The photon (A = +1) and the graviton (\ = +2) are examples of the realiza- 
tion of these representations. Just as in the massive case, one can construct 
a suitable By such that p, = By) and Wigner rotations and the explicit 
realization of a general Lorentz transformation on these states. We do not 
discuss these matters, since it is a little easier for these cases to obtain the 
wave functions and the transformations by solving local equations of motion 
from a field theoretic approach. 

One can also construct representations for which T; are not represented 
by zero. However, such representations have an infinite number of polarization 
states and do not seem to be of any physical significance. 


A-8 Position operators 


We have not discussed the position yet as an operator to be included in the 
algebra of observables. From the point of view of obtaining the representa- 
tions, this does not make much difference, x“ appear explicitly in the z-space 
wave functions. However, x“ is not appropriate as a position operator. For 
example, z° denotes time. It is, in the case of point-particles, just a vari- 
able parametrizing the path of the particle; to consider it as an operator 
would lead to difficulties of interpretation. Newton and Wigner have defined 
a proper notion of position operator and calculated it for various cases. We 
shall not discuss this in detail. For the case of massive spin-zero particles, 
one can see quite easily that x is not appropriate. It is not self-adjoint with 
the scalar product (A-26). A modification which gives a self-adjoint operator 
is 


2=iVp — (A-65) 
This is appropriate in the sense that it coincides with the position operator of 
nonrelativistic theory for small velocities; it is the center of mass for localized 
wave packets. Equation (A-65) is the Newton-Wigner (NW) operator for this 
case. For particles with spin, the NW position operator has spin-dependent 
terms in general. 


A-9 Isometries, anyons 


It is interesting to follow the logic of the previous sections, viz., of under- 
standing the one-particle states in terms of representations of the symmetry 
algebra, in some unusual situations. As an example, let us consider free parti- 
cle motion in a spacetime which is not necessarily flat Minkowski space. The 
action is given by S = —m f[ \/g,,dxdx”, where g,, is the metric tensor. 
The requirement that x4 — x’ + €* be a symmetry gives 
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Of oe* @ OG uv 

WY — A-66 
Gu + Wage + a =0 (A-66) 
We can write this equation as 


where 
Vey ad OnEy a | eet 


OGuv Ogpv Og B 
re 1 afk Lid } Lad A- 
py = 29 ( 0x8 Ort Ox” (A-68) 


I'™, is the Christoffel symbol and V,& is the covariant derivative of &,. 
Equation (A-67) is the Killing equation, the solutions €" give the isometries 
or transformations which leave the distance ds (and hence the particle ac- 
tion) invariant. The isometries will form a group, the isometry group, of the 
spacetime of metric ds? = g,,,dxz“dx”. From our general discussion of quan- 
tum mechanics, the Hilbert space for one-particle motion on such a spacetime 
will be given by a UIR of the isometry group. A simple concrete example is 
provided by anti-de Sitter spacetime, which has the metric 


ds* = dz — dz? — dz} — dz} + dz? = nyvdz"dz" + dz? (A-69) 


with 
Nite see ie (A-70) 


One can solve (A-70) explicitly in terms of local coordinates, valid in some 
coordinate patch, as 


zo = Rsint, za = Reost coshy 
(z1, 22,23) = costsinh y (cos@,sin@cosy,sin@siny) (A-71) 


Substitution of these in (A-69) will give a more standard four-dimensional 
way of writing the metric. Using the presentation of the space in terms of 
(A-69), (A-70) with the auxiliary variable z4, we see that the isometries are 
transformations which leave the quadratic form (n,,z“z” + z?) invariant. 
These are the pseudo-orthogonal transformations forming the group SO(3, 2) 
(and some discrete symmetry transformations). Thus UIR’s of SO(3, 2) will 
describe possible types of particle motion on anti-de Sitter spacetime. 

As another example, consider three-dimensional spacetime, with a metric 
of the form 

ds* = dt? — g,;dx'da’, i,j =1,2 (A-72) 

For simplicity the nontriviality of the metric is restricted to the spatial di- 
mensions. For flat space, 9; = ij, and the rotation 6x; = 0¢;;z/ is clearly 
a symmetry. The states are of the form |p, s) with the action of rotations in 
the rest frame, for which p,, = po, given by 
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U(8)|p, s) = e**? |p, s) (A-73) 


If we allow multivalued representations, s need not be an integer. Particles 

with any value of spin are generically called anyons. The quasi-particles rele- 

vant to the fractional quantum Hall effect are of this type. (For this system, 

the physics is essentially planar and so a two-dimensional description is rea- 

sonably accurate. Also a nonrelativistic approximation is quite adequate.) 
If space is a sphere of radius R, 


gijdz'da? = R?(d6? + sin?6dy?) (A-74) 


in terms of the usual angular coordinates 6, on the sphere. The isometries 
in this case are 


€(1) = (-sing, —cosy cot@), (2) = (cosy, —siny cot), — €3) = (0,1) 
(A-75) 
The generators L,, = —i€/,,)(0/0z*) obey the angular momentum algebra 
[Lyu, Lv) = teyvela, (u,v,a = 1,2,3). In other words, the isometry group 
of the two-dimensional sphere is SU(2), the angular momentum group. Uni- 
tary representations clearly require integer or half-odd-integer values of spin. 
Thus a spherical world cannot support anyons, at least with the action 
S = —m f ds. It is also interesting to investigate what kind of anyons are 

possible if the world is a two-dimensional torus of metric ds? = d6? + dy. 
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Gauge theory 
BRST symmetry, 195 
configuration space, 190 
Gauge transformations 
space of, 188, 358 
Gauss law, 187 
Gauss linking number, 451 
Gauss-Bonnet theorem, 337 
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Gell-Mann-Okubo mass formula, 259 


Generating functional 

free Dirac field, 53 

free scalar field, 47, 49 

functional integral, 105, 108 

interacting scalar field, 56, 58 

Yang-Mills theory, 192 
Geometric quantization 

S? and SU(2) algebra, 501 

and H’(M,R), 494 

and H?(M,R), 496 : 

Chern-Simons theory, 513 

coherent states, 498 

holomorphic polarization, 493 

polarization, 492 

prequantum operators, 491 
Glueballs, 462 
Goldberger-Treiman relation, 259 
Gravity, Einstein equation, 310 
Gribov ambiguity 

gauge fixing, 193 

nontrivial bundle, 332 

three dimensions, 334 


Haar measure, 475 

Hard thermal loops, 413 

Heisenberg equation of motion, 24 

Helicity, 542 

Higgs mechanism, 264 

Higgs particle, 272 

High temperature 
dimensional reduction, 433 
symmetry restoration, 434 

Hodge dual, 305 

Holonomy, 392 

Homotopy groups 
definition, 317 


exact sequence, 330, 362, 386, 456, 


459 
Lie groups, 321 
spheres, 319 


In-field operator, 72 

Instantons 
6-parameter, 522 
*t Hooft interaction, 370 
axial U(1) problem, 295 
baryon number violation, 371 
Bogomol’nyi inequality, 366 


fermion functional integral, 369 

field configuration, 364 

index theorem, 367 

instanton number, 326, 364, 521 
Isometry group, 544 


Jacobi 9-function, 520 
Jacobian variety, 519 


Kahler manifolds, 312, 504 
Killing equation, 544 
Killing equation 
conformal, 40 
Minkowski metric, 39 
Klein-Gordon equation, 2 
Klein-Nishina formula, 94 
Kubo formula, 419 
Kubo-Martin-Schwinger condition, 404 


Lamb shift, 156 
Landau damping, 421 
Landau gauge, 194 
Landau pole, 155 
Landau-Ginzburg theory, 447 
Large N expansion 
chiral symmetry breaking, 464 
double line, 460 
factorization, 462 
Lie derivative, 306 
Linking number, 451 
Loop expansion, 127 
Lorentz transformation 
spinors, 4 
vectors, 533 


Mandelstam variables, 260 

Massive vector particle, 205 

Matsubara frequencies, 430, 432 
Mermin-Wagner-Coleman theorem, 240 
Metric, 308 

Minimal coupling, 81 

Mott scattering, 87 


Nielsen-Ninomiya theorem, 480 

Nonequilibrium phenomena 
Boltzmann equation, 426, 427 
Kadanoff-Baym equations, 425 
time-contour, 423 

Nonrenormalizable theories, 165 


One-particle irreducible diagrams, 122 
Optical theorem, 262 
Orientable manifold, 303 


Parallel transport 
gauge theory, 182 
gravity, 310 
Parity, 225 
Partition function 
fermions, 402 
functional integral, 429 
lattice gauge theory, 475 
scalar field, 400 
Path integral 
0-parameter, 363 
and 17;(C), 349 
and H?(C), 351 
evolution kernel, 339 
for QCD, 362 
Huygens’ principle, 348 
Schrodinger equation, 342 
Wess-Zumino term, 351 
Pauli-Lubanski vector, 535 
PCAC relation, 261 
Phase space 
definition, 18 
light-cone quantization, 37 
Liouville measure, 487 
Photon 
covariant propagator, 81 
mode expansion, 79 
polarization vectors, 79 
self-energy, 150, 155 
transverse propagator, 80 
Pion decay, 83, 95, 292 
Pion-nucleon interaction, 259 
Pion-pion scattering, 259 
Planck mass, 167 
Plasma oscillations, 411 
Poincaré algebra, 533 
Poincaré duality, 314 
Poincaré lemma, 304 
Point-particle action, 532 
Poisson bracket, 21, 485 
Poisson bracket 
light-cone quantization, 513 
WZW model, 526 
WZW model, light-cone, 527 
Propagator 
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fermion, 50 

fermions, thermal, 403 
ghost, 194 

gluon, 194 

gluon, one-loop, 206 

gluon, pinching term, 213 
scalar field, 45 

scalar, thermal, 401 

thermal, imaginary-time, 430 


Quantum chromodynamics 
A-parameter, 215 
0-parameter, 363, 520 
asymptotic freedom, 214 
axial U(1) problem, 293 
axial anomaly, 291 
chiral symmetry breaking, 255 
effective action for anomalies, 382 
path integral, 362 

Quark masses, 253 


Radiation gauge, 79 
Reduction formula, 60, 347 
Regulators 
anomalies, 279 
counterterms, 167 
definition, 133 
dimensional regularization, 207 
Pauli- Villars, 279 
Renormalization 
Z-factors for scalar field, 134 
Z-factors, 6m, for QED, 148 
Z-factors, Yang-Mills theory, 204 
BPHZ, 158 
counterterms, 157 
forest formula, 161 
QED, one-loop, 154 
recursion formula, 160 
Weinberg’s theorem, 164 
Renormalization group 
critical exponents, 174 
scalar field, 168 
Retarded functions 
and finite T, 418 
Retarded propagator 
and interacting field, 73 
scalar field, 44 
Rho meson decay, 99 
Ricci tensor, Ricci scalar, 310 
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Riemann curvature, 308 
Riemannian manifold, 308 
Rutherford scattering, 89 


Scalar field 
B-function, 172 
anomalous dimension, 172 
Euclidean propagator, 46 
Feynman propagator, 45 


generating functional, 47, 49, 56, 58 


partition function, 400 
retarded propagator, 44 
thermal propagator, 401 
Schrodinger field, 70 
Schwinger model, 395 
Schwinger-Dyson equations 
y* theory, 125 
general form, 124 
nonequilibrium, 424 
Sigma models 
coset spaces, 247 
functional integral, 114, 118 
principal chiral model, 247 
Skyrmions 
and large N, 467 
baryon number, 468 
spin and flavor, 472 
topological current, 467 
Slavnov-Taylor identities, 202 
Spin-statistics theorem, 15, 31 
Spontaneous symmetry breaking 
chiral currents, 260 
currents, 251 
effective action, 249 
Goldstone’s theorem, 242 
PCAC, 261 
Standard model 
C'P-violation, 275 
W, Z, masses, 272 
action, 270 
baryon number violation, 371 
CKM matrix, 273 
mixed anomaly, 381 
particle content, 269 
symmetry restoration, 437 
Weinberg angle, 271, 272 
Yukawa terms, 272 
Stokes’ theorem, 304 
Stone-von Neumann theorem, 532 
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Strong CP-problem, 364 
Structure factor, 92 
Symmetries 

Goldstone realization, 218, 235 

of QCD, 254 

Wigner realization, 218 
Symmetry restoration 

high T, 434 

standard model, 437 
Symplectic structure, 19, 483 


Thomson cross section, 87, 94 
Time-reversal, 228 
Topological manifold, 297 
Torsion tensor, 308 


Uehling potential, 156 

Unitarity 
bound for cross section, 263 
effective theory, 264 
general relation, 262 
nonrenormalizable theory, 166 
optical theorem, 262 

Unitary gauge, 265 


Vacuum diagrams, 64, 121 
Vacuum polarization, 150, 155 
Variational principle, 489 
Vector fields 

definition, 298 

Hamiltonian, 485 
Veneziano-Witten formula, 294 
Vertex functions, 59 
Vortices 

Zn, 453 

Abrikosov-Nielsen-Olesen, 447 


Ward-Takahashi identity 

O(N) theory, 221 

BRST symmetry, 200 

low-energy theorem, 230 

QED, 155, 222 

Yang-Mills effective action, 203 
Wave function 

Chern-Simons theory, 515 

scalar field vacuum, 345 
Weil homomorphism, 317, 335 
Weinberg angle, 271, 272 
Weinberg’s theorem, 164 
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Wess-Zumino condition, 286 definition, 184 
Wess-Zumino-Witten model lattice, strong coupling, 477 
action, 388 Winding number, 319, 321, 359 


canonical two-form, 525 
canonical two-form, light-cone, 527 Yang-Mills theory 


Dirac determinant in 2 dim., 394 @-function, 214 
Kac-Moody algebra, 527 action, 181, 326 
level number, 389 BRST symmetry, 195 
Poisson brackets, 526 functional integral, 192 
Poisson brackets, light-cone, 527 Yukawa interaction 
Polyakov-Wiegmann identity, 389, pion-nucleon, 259 

395 standard model, 272 


Wick’s theorem, 60 
Wigner D-functions, 245, 472, 476,537 Zero-point energy 


Wigner rotation, 537 Dirac field, 29 
Wilson lattice action, 474 Lorentz invariance of vacuum, 27 
Wilson loop operator scalars, 12 


area law, 445 Zimmerman’s forest formula, 161 
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